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QUANTITATIVE ANALYSES OF PSYCHOANALYTIC 
STUDIES OF SHAKESPEARE* 


Department of Psychology, State University of New York, Brockport 


M. S. LINDAUER 


“Whenever you can, count"—Francis Galton. 
"You may prove anything by figures"—Thomas Carlyle. 
"T am ill at reckoning”—Shakespeare. 


A. INTRODUCTION 


Holland's extensive review and analysis of psychoanalytic studies of Shake- 
speare (4) convincingly argues for the contribution of psychoanalysis to lit- 
“The insights of psychoanalysis complement and fulfill those of con- 
(4, p. 347). Yet Holland's effort, sophisticated and 
articulate as it is, is more of an artistic and literary performance than a sci- 
entific and objective accomplishment. Аз a man of literature and not of sci- 
ence, Holland's approach is justified and perhaps appropriate in handling the 
quasi-scientific/artistic status of psychoanalysis. “Psychoanalysis represents a 
most important strategic bridge from the rigorous, external aims of the sci- 
ences, and those internal aims of the humanities, less rigorous but more sig- 
Wrap and, if you will, more practical" (4, p. 348). 

The absence of the scientific method has been noted by Plank in his review 


of Holland's book (15) as the serious flaw in psychoanalytic studies of Shake- 
р Е the role of psychoanalysis shares this short- 


speare, Holland’s treatment 0 руси заге s short 
coming. In particular, it is the absence of еруге analysis, the scientist's 
tool of exactness and precision, which limits ше à ormation that could be 
abstracted from masses of raw data. Statistical techniques systematically de- 
scribe and relate data, and allow inductive statements regarding the probabil- 
ity of events to be made: i.e, whether observations are due merely to chance 
or to significant factors. Questions can thereby E D with some assur- 
ance, and promising new inquiries can be initatec. “па sense, such a strategy 


ч : “Ве sure of it; give me ocular proof" 
follows Shakespeare's own dictum à á 


MEUM. mee psychologists (in contra.c to psychoanalysts and 
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clinical psychologists) have sympathetically seen an interrelationship between 
psychology and literature (1, 2, 8). Most psychologists have viewed the re- 
lationship between the two subjects as one of incompatibility, alienation, and 
failure (6, 17, 20), mainly because of literature’s unsusceptibility to standard 
technical analyses. Nevertheless, some attempts have been made to apply quan- 
titative methods to the study of literature (9, 10, 11, 18), including Shake- 
speare (12, 19). Yet even one favorably disposed to such an analysis has been 
less than enthusiastic over the results (15, p. 202). 

The quantitative approach to qualitative material evokes the controversy 
between the statistical versus the clinical approach to man. Miller (14) has 
posed the disparity between these two strategies in this manner: 

"For several years there has been a running battle between the clinical heirs 
of Sigmund Freud and the statistical heirs of Sir Francis Galton, The Freud- 
ians learn about people by talking to them; the Galtonians give tests and 
compute statistics. When both groups are not busy doing this, they like to 
spend their time criticizing each other" (14, p. 315). 

Miller further illustrates this dilemma as follows (14, p. 317) : clinicians 
attribute to their method such terms as "dynamic, rich, genuine," while they 
attach to the statistical method the labels "mechanical, artificial, trivial." On 
the other hand, statisticians call themselves “verifiable, reliable, testable” and 
claim the clinicians are "mystical, primitive, muddleheaded.” 

The purpose of this paper is to investigate an 
found within the context of its analy: 
land. He focused on studies of Shake 
and the creativ 


aspect of psychoanalysis, as 
sis of Shakespeare, not pursued by Hol- 
speare the man, his works, the audience, 
е process. But Holland missed the opportunity to disclose some- 
thing of psychoanalysis, as its interests are revealed in the study of Shake- 


speare. The present inquiry, using quantitative techniques, is addressed to this 
issue, 


B. METHOD 


The subject index and listings of books and 
bibliography of psychological studies of literature (7) were examined for ref- 
erences to Shakespearean plays and characters, as well as psychological traits 
dealt with in the plays and characters. The index was considered a reliable 
indicator of the main emphasis and concern of a study, since the citation was 
based not only on the title of the work, but on the editor’s reading of either 


the reference itself, its abstract, or a review (7, p. 5). Although Shakespearean 


material may have been mentioned in Psychological works other than those 
included in this survey, omission from the index of a study would indicate that 


articles in Kiell’s exhaustive 


M. S. LINDAUER 5 


treatment of Shakespeare in the study was peripheral to some other topic or 
issue.! The psychoanalytic status of a work, generally defined so as to include 
all varieties and schools of depth analysis, was determined either through the 
index reference or by a reading of the title. While a few nonpsychoanalytical 
works may therefore have been accidentally included, over 98 per cent of the 
studies could be rather easily categorized as psychoanalytic. In addition, early 
(pre-Freud) writings were also included, so as to serve as a base for later 


comparisons in the trends-over-time analysis. 
C. RESULTS 


1. Preferences for Plays 


rote 37 plays (16), 65 per cent or 24 of which have been 
n detail by psychoanalysts in 196 publications. With 84 
additional studies concentrating on Shakespeare himself, psychoanalysis appears 
disproportionately more interested in the man than in any one category of his 
works (x? = 4.00, df = 1, p < 05 > 01 ). Arranging the plays of interest 
to psychoanalysis into the categories Comedy, Tragedy, and History (5), the 
24 plays are distributed as follows: 9, 11, and 4, respectively (out of a total 
possible breakdown of 14, 13, and 10, respectively)? Although it would ap- 
pear that historical and comic categories were of less interest than tragedies, 
the differences are not statistically significant (x? = 0.06, df = 1, р > .05). 
Nor is there any statistical difference, in terms of these three categories, be- 
tween the number of plays written by Shakespeare and the number studied by 
psychoanalysts (Х2 = 1.76, df = 1,6 > .05). However, with respect to the 
number of studies devoted to plays in these three categories, more effort was 
spent on the tragedies (79 per cent of the 196 studies) than on the comedies 
(15 per cent) or histories (6 per cent) ; these differences are statistically sig- 
nificant (x2 = 164.67, df = lp< 101). Of the tragedies, Hamlet is easily 
the most preferred, accounting for 58 per cent. Together with Macbeth, 


Othello and King Lear, these four tragedies account for 72 per cent of all 


studies. 
2, Time Trends 
Fifty per cent of all psychoanalytic studies of Shakespeare and his works 
бызге ud since 1943. "This intcrest 18 increasing: 3 per cent, 9 per cent, 
16 per cent, 16 per cent 23 per cent and 27 per cent of all studies, respectively, 
3 


eements between 5 
d psychologic 


Shakespeare w: 
specifically studied i 


n Kiell (7) and Holland's (4) 
al material which made any 


disagr! 
iy include 


ere all in the Henry series. 


1 This helps to account for 
Coverage, since the latter apparen 
reference to Shakespeare. , -— 

2 The six omitted historical plays 
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were published in six 10-year intervals beginning in 1903. This overall trend 
is also reflected in the subtotals that refer to the three categories of plays, as 
well as those studies concentrating on Shakespeare. Comedies and histories 
have shown the greatest spurt in the most immediate decade (1953-1962) : 
31 per cent of the former type and 54 per cent of the latter were published 
then. Studies of tragedies and of Shakespeare have shown relative stability 
over the past two-three decades. Over the entire time span, studies of tragedy 
have outnumbered those of Shakespeare by at least if not more than 2:1 (in 
1953-1962, the percentages were 55:25) except for 1933-1942 (42:40) and 
a period before Freud, 1851-1902 (38:62). Finally, psychoanalysts have not 
Shown any special interest in plays written in any one particular period of 
Shakespeare's career. Thus, plays written during Shakespeare's early, middle, 
and late years (16), received about equal amounts of attention, as shown by 


the lack of a significant correlation between date of authorship and frequency 
of analytic studies (ryn = —.13, > .05). 


3. The Characters Studied 


There are 325 characters in Shakespeare's plays (13).3 This is an average 
(mean) of 8.78 characters per play: historical plays have the most (10.80) 
and tragic plays the fewest characters (7.61). In 33 studies, psychoanalysts 
have specifically focused on the characters in 12 plays, concentrating on only 


about 6 per cent of the total possible. Interest has been particularly directed 
toward the tragic figures (x2 — 10.12, df = 1, p < .01), 


accounting for 73 
per cent, 50 per cent, and 63 per cent of the above studies, plays, and charac- 
ters, respectively. 


4. The Traits 

In 136 studies, psychoanalytic authors s 
ical traits, about half or 28 of which re 
(e.g., insanity), with the remainder abo 
gories "symbols" (myths), “processes” 
and “developmental” (psychosexual sta 
traits (in six-11 studies), in decreasin 
Oedipus complex, dreams, and fantasy 


pecifically emphasized 60 psycholog- 
ferred to “psychopathological states” 
ut equally divided between the cate- 
(the unconscious), “motives” (love), 
ges). The most frequently referred to 
g order, were madness, homosexuality, 


| - The plays arranged by category, and 
the studies of Shakespeare, showed no unexpected differences in the distribu- 


tion of these traits, except for "psychopathology" which was referred to more 


often in tragedies and less often in Shakespeare than expected (y? — 15.08, 
4#==З„ф < 01). 


5 Titled characters—e.g., “courtesan”—were excluded from the count. 
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5. Freud and Other Analysts 


In his writings Freud specifically referred to about half (18) of Shake- 
speare's plays (4, pp. 359-365). In terms of the three categories into which 
Shakespeare's plays can be placed, no one type predominated in Freud's writ- 
ings, nor do the categories of plays studied by Freud differ from those not 
studied by him (x? — 1.67, 4f — 2, p > .05). Compared to plays studied by 
psychoanalysts in general, Freud made no mention of 14 plays noted by other 
analysts. Particularly noteworthy, in that six studies were devoted to each, 
All’s Well That Ends Well, and The Winter’s Tale; Freud 
play not studied by other analysts (Love’s Labour’s Lost). 
Both Freud and the analysts jointly referred to 18 plays (4, pp. 371-390), 
able agreement in the attention given to them: the difference 
he number of studies associated with each play (arranged 
(t = 0.18, df = 17, р > .05); e.g, Hamlet 
t attention from both Freud and the analysts, 
othing received little, and Julius Caesar, 
d intermediate amounts of interest. 


are Coriolanus, 
referred to one 


showing remark 
between the two in t 
into ranks) was not significant 
and Macbeth received the тоз 
Henry 6 and. Much Ado About М i 
Richard 3, and Romeo and Juliet receive 


D. Discussion AND CONCLUSIONS 


"The following picture of psychoanalysis emerges from a quantitative exam- 
ination of its treatment of Shakespeare and his works. More interested in the 
man than in any one category of his works, psychoanalysts have studied a rep- 
resentative proportion but not nearly all of Shakespeare's plays. Of those 

stinctions between works produced 


i i i kes no di 
stu le psychoanalysis ma 
died, while psy between the comic, tragic, and his- 


t 

at various periods in Shakespeare 5 life, or : | 

torical categories of plays, not all plays have received the same consideration: 
ical ca А 


t a ferred. Of the tragedies studied, not all have received the 
ragedies are preierre- "This somewhat narrow and confined approach to the 
s : nei ob ae ychoanglyst concentration on a small number of charac- 

a a ith tragic figures receiving the most emphasis. Psychoana- 
ters in the plays, wit! le exceptions paralleled by Freud 


: tab. 

"m ‘ch are with some no : ; 

p „к= е кай frequently in relatively recent years, with some 
self, are а 


categories of plays showing à leveling of ше while e ы. i vae 
i ic plays, with 2 few exceptions, ave consistently been of the 
increase. Tragic plays, he years: Finally, psychoanalysts have concentrated 
greatest interest oveT ше i the "normal" aspects of Freud's contribution 
more on the morbi | ан icularly characteristic of the tragedies. 

to the study of man, +. derived from quantitative analyses, raise 


These aspects of PSYC 
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certain questions that are not within the province of this study to answer, but 
which presumably are more amenable to literary rather than quantitative at- 
tack. Firstly, what reasons can be ascribed to the relative if not absolute ne- 
glect by psychoanalysis of many of Shakespeare's plays, as well as characters? 
Related to this question is the glaring disparity between those few plays solely 
of interest to psychoanalysts at large but not, evidently, to Freud. Are some 
plays of less literary merit; or do they lack psychoanalytic implications? Sec- 
ondly, the great interest evidenced in the tragedies may not be surprising, but 
why the neglect of the comedies? Freud has indicated the importance of humor 
in understanding the deep-seated forces in man (3). For that matter, Freud's 
interest in the role of the past in shaping the present and the future is perhaps 
contradicted by the slighting by psychoanalysis of the historical plays. In the 
same sense, the lack of any special concern with Shakespeare's early writings 
seems surprising. More of the man and his antecedents might be revealed more 
clearly in the early rather than later productions. Finally, psychoanalysis is a 
theory not just of neurosis but of the normal personality as well. Yet psycho- 
analysts—as is perhaps true of most authors, as well as the man-in-the-street— 
have ignored the broader aspects of Freud in order to focus on the abnormal. 
It is hoped that this quantitative approach may be evaluated in light of the 
questions answered and those raised—questions that might not have been an- 
swered or could not have been raised by a qualitative approach. However, the 
ease with which figures may be statistically manipulated is a danger, especially 
if data are forced into convenient molds for purposes of measurement rather 
than for their usefulness and clarity, 
There is no more common error than to assume that, 


and accurate mathematical measurements have been 
tion of the result to some f 


Whitehead. 


because prolonged 
made, the applica- 
act of nature is absolutely certain—A, N. 


E. Summary 

The application of statistical techniques to psycho. 

speare and his works reveals the preferences of psyc 

acters, and traits; trends over time; and similaritie 
Freud and other analysts. Questions about some 

omissions of psychoanalysis in this area are raised o 
as well as the role of quantification in the psycholo 


analytic studies of Shake- 
hoanalysis for plays, char- 
s and distinctions between 
of the preoccupations and 
n the basis of the analysis, 
gical study of literature. 
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T-GROUP MEMBER RATINGS OF LEADER AND SELF IN 
A HUMAN RELATIONS LABORATORY*? 


Department о} Educational Psychology, The University of Texas; and 
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EARL A. Коше AND CAROLYN DRAEGER 


A. PURPOSES 


Changes in human behavior continue to be extremely difficult to measure 
and yet intriguing to discover, whether through carefully designed research 
or by simple observation. The present study focuses on change. ]t is an inves- 
tigation of how members of a human relations laboratory change in their 
perceptions of their group leaders during different T-Group (training group) 
sessions and in their perceptions of themselves as the laboratory experience 
progresses. More specifically the study investigates whether or not (а) mem- 
bers of T Groups and their group leaders become more like each other during 
the laboratory, (4) members see their leaders differently from session to 
session, (¢) members become more like each other in the ways they rate their 
leaders as the T Groups continue to meet, (4) member ratings of T-Group 
leaders become more like the leader’s ratings of himself as the sessions con- 


tinue, and (e) whether or not members tend to rate themselves differently 


at the end as compared to the beginning of the human relations laboratory 


experience. 
B. METHOD 

study consisted of participants in a four-day human 
graduates at the University of Texas at 
Austin? The 44 group members met together for eight general sessions con- 
ducted by the laboratory director, and were divided into four 'T Groups for 
six sessions with four experienced T-Group leaders. m А 
'The laboratory general sessions and Т Groups were experientially oriented 
toward the personal growth of members. "Theoretical implications and gen- 


The sample for the 


relations laboratory held for under 


= = seria] Office Provincetown, Massachusetts, оп January 31, 
Nero Pasi at 35 New Street, Worcester, Massachusetts. Copy- 
769, 
ae aged The bg eee to Shirley Bird Perry and Jack Steele of The Uni- 
verdi урен ош Union for their contributions to this study. . 
g“ ug this study were analyzed and reported differently in a thesis (1) 
ata us L 
supervised by the senior author. 
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eralizations tended to follow rather than to precede experiences. Activities 
during general sessions included brief discussions in small encounter groups, 
role playing; psychodramatic scenes, demonstration groups, and a variety of 
other verbal and nonverbal encounters. Thus the laboratory experience moved 
from general sessions to 'T Groups and back to general sessions to foster the 
personal development of individual members and to create a climate in which 
participants, hopefully, could discover freedom to explore relationships with 
others, their feelings about themselves, and patterns of their coping behavior, 
and could learn to give, receive, and use feedback constructively. 

'The rationale of the Semantic Differential (5) was used in developing an 
instrument consisting of 30 bipolar adjective pairs. The two adjectives in each 
pair were opposite poles of a seven-point rating scale. Each group member and 
leader used the 30 adjective scales (a) to rate the T-Group leader at the con- 
clusion of each of the six T-Group sessions and (b) to rate himself in the 
beginning and in the concluding general session of the laboratory. 

In scoring the instrument, a score of --7 represented a rating at the positive 
end of each adjective scale (e.g., warm) ; a score of +1 represented a rating 
at the negative end (e.g. cold). Ratings on the 30 scales were considered 
separately, as well as in sum, in the statistical design of the study. The anal- 
ysis of variance method with single classification and repeated measurements 
was used to test for statistical significance, except where similarity among 


members was of interest and a test for the significance of the difference between 
correlated variances was appropriate. 


C. RzsULTS AND DISCUSSION 


1. Overall, T-Group members and leaders did not rate themselves more 
similarly at the end compared to the beginning of the group experience. Mean 
absolute differences between self-ratings of each member and his leader at 
the beginning and again at the end of the laboratory were computed, and the 
overall means were compared by analysis of variance. 'The difference between 
the initial and the final mean sum of member-leader differences оп the 30 
adjective scales was not significant. (Initial and final mean sums of member- 
leader differences were 43.36 and 44.20 respectively.) 

'The finding that 'T-Group members and leaders did not become more alike 
suggests that members may not have been modeling themselves after the group 
leader, even though this is often regarded as membership tendency and as а 
correlate of change (6). Or, members may have already been sufficiently like 
their leader and, therefore, were unlikely to become more like him. It is con- 
ceivable that influences of the general sessions combined with rw in 
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the T Groups and in the laboratory overall may have reduced the need for 


member-leader modeling. 
2. Laboratory members ch 
session to session, thus suggesting t 
the group climate were continuously cha 
parison of mean sums of scores representing members’ ratings of the group 
leaders in the six T-Group sessions yielded an F ratio of 14.10 which, with 
5 and 215 degrees of freedom, was significant far beyond the .001 level. More- 
over, in the comparison of mean ratings on each of the 30 adjective scales, the 
change in members’ ratings of the group leaders over the six sessions was 
found to be significant beyond the .05 level on 25 of the 30 scales. 
a. Laboratory members saw the T-Group leaders in a more positive 
in the last T-Group meeting than they did in the first meeting. An 
indicated a highly significant difference be- 
tween the mean total scores representing members’ ratings of the group leader 
in the first and last meetings. (First and last session means were 143.36 and 
162.18 respectively ; р = 23.41; df = 1,43; 5 < 001.) Moreover, there were 
significant differences between the first and last session ratings on 19 of the 
30 adjective scales. Qualitatively speaking 1n the last, compared to the first 
T-Group meeting, members regarded the group leaders as warmer, deeper, 
and more open, reaching out, trusting, involved, sensitive, supporting, articu- 
late, understanding, caring, insightful, accepting, natural, authentic, forth- 
us, sharing, 4 


anged their ratings of the T-Group leaders from 
hat member perceptions and, by inference, 
nging. Àn analysis of variance com- 


manner 
analysis of variance comparison 
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eader in a significa 
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egarded the leaders in a significantly more negative manner than in the pre- 
eding session. "There was also a slightly negative mean shift from the fourth 
o fifth sessions, but statistical significance was not attained. The positive shifts 
n mean member ratings of the group leaders from the second to third, third 
to fourth, and fifth to sixth T-Group sessions were all statistically significant. 
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FIGURE 1 
PLor or MEAN TOTAL Scores REPRESENTING MEMBERS’ RATINGS OF 
+ _ THE GROUP LEADER FOR EACH 'T-GROUP SESSION 
Analysis of variance data on the significant comparisons between means are given 


below in the following order: i : i А 
with df = 1,43. Е sessions compared; respective means; F ratio; and $ 


and 2, 143.36 and 136.02, 4.01, 


1 

< 05. 
2 and 3, 136.02 and 142.98, 5.69, < .05. 
3 and 4, 142.98 and 154.25, 17.84, < .001. 
5 and 6, 153.14 and 162.18, 6.04, < .05. 
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Additional research is needed in which comparisons are made of member 
perceptions of group leaders from session to session. Comparison data should 


add to an understanding of the processes and movement of groups, since 


changes in member perceptions of group leaders could be related to patterns 


of leadership, as well as to phases through which groups move. 
3. The 44 laboratory members did not become more alike in the ways they 
saw the leaders as the laboratory continued : i.e., they did not vary less in their 
ratings of T-Group leaders in late compared to early sessions. Results of £ 
comparisons of variances in › ratings of the group leaders in the six 
sessions revealed that the 44 members did in fact vaty more—i.e, were less 
alike—in their ratings of the 'T-Group or the fifth and sixth 'T-Group 
sessions than for either the first oT the third session.* 
'The most obvious explanation of the increased dissimilarity in ratings of 
T-Group leaders by the 44 Jaboratory members might be the fact that four 
different leaders Were being rated. If the laboratory experience did achieve 
acy of participants, the increased varia- 
- creased accuracy among members of each T Group, 
i sed difference between members of 
different leaders. Thus 


the four different groups since 
it is conceivable t i e the 44 laboratory members did not become more 


alike in ratings of the four leaders, members within each separate T Group 
could nonetheless have become more alike in the ways they saw their group 


leader. 
ui rdance with the Burke and Bennis (2) con- 


Such a trend would be in accor’™ 
cept of perception cong According to this concept, group members are 


expected to reach grea reement in their perceptions during the course 
of a group experience. Perception congruity may s determined by two opera- 
tional measures: tween à mem er's perceptions of himself 
and other members’ perceptions of him, and (b) s a of consensus 
among group members! P fa particular member. The present ques- 
tion involves the $° +... the amount of consensus 1n members 
sut sts en и mbers within each of the four T Groups 


To determine then whe 


тиу. 


re given below in the following order: 


а comparisons a А E À 
the significant f SD," between ratings for the two sessions compared, 


25.47, 10, 2.87, < .01. 


^ Data оп 
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became more alike in their perceptions of the group leader, variances in mem- 
bers’ ratings of the leader in the first and in the third T-Group session were 
compared with variances in the fifth and in the sixth session in each of the 
four T Groups. These were the same comparisons on which the 44 laboratory 
members as a whole were found to be less alike in their perceptions of T-Group 
leaders in the later session. According to results obtained in the comparisons 
within the separate groups, members in none of the four T Groups became 
more alike in the ways they saw the group leader. Moreover, members in one 
group became less alike in their perceptions of the group leader.5 

Thus when members’ early and late perceptions of the leaders were com- 
pared, results indicated that neither the 44 laboratory members nor the mem- 
bers within each of the four Т Groups became more alike. No evidence was 
provided, therefore, to indicate that the laboratory experience led to greater 
consensus in members’ perceptions of the group leaders. 

4. Compared to ratings for the first session, members of T Groups re- 
garded the T-Group leaders more like the leaders regarded themselves in the 
last T-Group session. Absolute differences between members’ ratings of the 
leader and the leader’s ratings of himself were computed, and an analysis of 
variance comparison revealed that the mean sum of member-leader differences 
on the 30 adjective scales for the last session was significantly smaller than the 
first session mean difference. (First and last session mean sums of member- 


wee were 44.07 and 37.66 respectively; F = 9.57; df = 1,43; 
P «.01. 


themselves is an instance of the first measure of perception congruity: i.e., 
congruence between a person's perception of himself and others’ perceptions of 
him. Findings in this study support to some extent those of Burke and Bennis 
(2), even though different Statistics were used in the two studies, While the 
present study used analysis of variance comparisons of mean absolute differ- 
ences and ¢ comparisons of correlated variances, Burke and Bennis used the 
statistically more sophisticated D scores and factor analytic procedures along 
with the Wilcoxon test. Their results indicated that (a) the way members 


members regarded them became more similar; but that (b) late in the labora- 


5 Data on the significant ¢ comparisons in the one Sroup are given below as in 
Footnote 4, with p given for df = 9. 
1 and 6, 15.07 and 31.93, .66, 424, < o1, 
3 and 6, 15.46 and 31.93, .75, 4.70, = Oly 
3 and 5, 15.46 and 26.29, .75, 3.34, < .01. 
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tory, members varied as much as they had earlier in their perceptions of 
particular individuals on the Friendliness and Combination factors, analogous 
to the present finding that there was no difference between variances in mem- 
bers’ perceptions of the group leaders carly and late in the laboratory. 

It should be pointed out that although mean member ratings of the leaders 
were higher or more positive at the end compared to the beginning of the lab- 
oratory, this mean shift was not accompanied by a shift in the dispersion of 
individual members’ ratings about the mean. Similarly, the mean shift toward 


smaller member-leader differences in perceptions of the leader was not ac- 


companied by a shift toward smaller dispersion of members' ratings about 


their mean rating of the group leaders. 

5. Members changed their self-ratings in a positive direction from the be- 
end of the human relations laboratory experience. The differ- 
ence between the mean total scores for initial and final self-ratings—means = 
151.91 and 160.25 respectively—when tested for significance, yielded an F 
ratio of 9.76 which was significant beyond the .01 level (df = 1,43). This 
finding, along with the last two reported, should be regarded with caution ir. 
the absence of similar measures on a control group. Moreover, a possible halo 
affect, developing out of the increased involvement in the laboratory experience 
could influence responses in a positive direction. A replication, with the use of 
a control group, is needed to confirm this finding. 

As a group, the 44 participants in this laboratory appeared to be homo- 
geneous on several variables—age (most were 21), classification (mostly 


sophomores and juniors), grades (average of B or better), involvement in 
activities, and a preference for the collegiate subculture. In 


n T Groups, Stock (7) reported that homogeneous 


National Training Laboratory groups at Bethel were seen as having. less 
incentive to change, since member behavior was reinforced by a relatively 
stable culture. With regard to the present study, therefore, one question worth 
Would the changes that appeared to be associated with the 


e been greater if the participants had been more 
ariables? The members could have been 


quite different on other variables, such as emotional and personality patterns, 
which are relevant to change. In fact, the magnitude of the standard deviation 
of their initial self-ratings, 17.4, suggests that they were rather heterogeneous 
with respect to self-perceptions and possibly their self-concepts at the eis 
of the laboratory. The influences of homogeneity versus heterogeneity on group 
^R In conti Burke and Bennis (2) also fou 


9 In h ver, t 
contrast, however, ag a à 
to agree more With regard to extent of participation of partic 


ginning to the 


campus leadership 
a review of research o 


raising might be: 
laboratory experience hav 
heterogeneous on these demographic v 


nd that members did come 
ular individuals. 
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development, on incentives for behavior change, and on resistance to change, 
though not investigated in this study, represent an important area for T- 
Group research. Harrison (4) predicts greater learning from the conflicts 
which grow out of polarized subgroups within a heterogeneous group, in con- 
trast with a more homogeneous group in which conflict is less likely to occur. 

6. T-Group members changed their self-ratings on 11 of the 30 adjective 
scales. In their final, compared to initial, self-ratings, they rated themselves as 
significantly more warm, trusting, sensitive, supporting, deep, caring, au- 
thentic, relaxed, easy, natural, and dependent. (F ratios ranged from 4.76 


to 12.77; df = 1,43; р < .01 for six comparisons and р < .05 for five 
comparisons. ) 


On the first seven of these 11 scales, 


adjective initially and became more positive in final self-ratings. In contrast, 
on the next three scales—tense-relaxed, anxious-easy, and self-conscious-natural 
—the mean initial self-rating by T-Group members was toward the negative 
adjective (listed first) and shifted across the mid-point (4.0) toward the 
Positive adjective in final ratings. On one adjective scale—independent-depen- 
dent—T-Group members saw themselves as changing in the “negative” 
(dependent) direction, although the mean final self-rating was still toward 
the positive adjective. This shift could represent a positive movement toward 
the development of interdependent attitudes and behaviors. 

These positive adjectives, though limited by the design of the research instru- 
ment, describe to some extent the kinds of member behavior regarded as 
desirable outcomes of the group experience. At the same time the emphasis was 
not simply on achieving such specific behavior outcomes, Rather, the main focus 
was on the process by which members and leader experienced and learned from 
their encounters with each other, These adjectives also represent partial de- 


scriptions of the developing group climate or culture which is in process and 
never completely achieved, 


the mean was toward the positive 


7. The 44 laboratory members were not more alike in the ways they saw 
themselves at the end compared to the beginning of the group experience, ac- 
cording to the ¢ test of the difference between correlated variances. (SDs for 
initial and final total self-rating scores were 17.24 and 17.95 res 
71,2 = 49, 2 < 1.0.) Thus the mean positive change in members’ se 
was not accompanied by a change in the dispersion of individual 
ratings about the mean. 

Although the ¢ test revealed that members in the total sample did not vary 
less and, hence, did not become more alike in their overall self-ratings, the 
question was raised as to whether members of the same Т Group became more 


pectively, 
lf-ratings 
members' 


"Cm ow» 


ы 
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like one another during the course of the group experience. It is conceivable 
that pressures toward conformity, if they do emerge during the laboratory 
experience, might be observed empirically in the separate Т Groups, but not 
in the laboratory group overall. Consequently, the variances in members' ini- 
tial and final self-ratings were compared separately for each of the four T 
Groups. None of the four £ values approached statistical significance. Thus, 
as was true for the total sample of laboratory members, members of the 
same T Group were not more alike in the ways they saw themselves at the 
end compared to the beginning of the group experience. 

The element of heterogeneity suggested by this finding, though not anti- 
thetical to group cohesiveness, appears somewhat at odds with the theoretical 
notion that group members tend to become more conforming. Changes in 
group variability might be unexpected since, as pointed out earlier, the mem- 
bers appeared to be rather homogeneous demographicaly in several ways. 
Still, homogeneity on demographic characteristics did not necessarily restrict 
heterogeneity with respect to emotionality and personality patterns. The 
magnitude of the two standard deviations above indicates that there was 
in fact considerable variability in the self-images of members at the outset and 
that it was possible for them to become more similar, as would be shown if 


there were less variability at the end of the laboratory. 


D. SUMMARY 
y 44 university students and by their leaders in 
four Т Groups were analyzed for information on how members of a human 


relations laboratory change in their perceptions of their group leader and of 
themselves. To measure change in members’ perceptions of leader and self, 
data obtained early and late in the laboratory were compared. The statistical 
data included (а) mean number ratings of leader and of self, (b) variances 
in members’ ratings of leader and of self, and (c) mean absolute differences 
between each member and his leader in ratings of self and of leader. The 
statistical methods included the analysis of variance with single classification 
and repeated measurements, and the ż test of the difference between correlated 


variances. . : 
Contrary to expectations, 'T-Group members’ self-ratings did not become 
Jeaders at the end of the group experience. Anal- 


more like self-ratings of their 
however, indicated (a) that members saw 


ysis of members’ ratings of leaders, s 
leaders more positively in the last T-Group session than in the first session, 


but (0) that the shifts in members’ perceptions of the leaders from session to 
ly positive. 


Bipolar adjective ratings b; 


session were not continuous 
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The 44 members as a whole and the members in the separate Т Groups did 
not become more alike in their perceptions of the group leaders at the end, 
compared to the beginning, of the group experience. It was found, however, 
that members came to perceive the leaders more like the leaders perceived 
themselves. 

T-Group members saw themselves more positively at the end than at the 
beginning of the laboratory experience. In essence, they regarded themselves 
as being more authentic and natural; more sensitive and supporting; and more 
able to trust and to care for others and themselves. "The significant positive 
change in the mean of members’ self-ratings, however, was not accompanied 
by a tendency for group members to become more similar in their self-ratings. 
"There was considerable variability in the self-perceptions of members at the 
outset of the laboratory experience. Even though it was possible, then, for 
members to become more similar during the experience, comparison of vari- 
ances in their initial and final self-ratings—both for the 44 laboratory members 
as а whole and for members of each of the four Т Groups—yielded no sig- 
nificant difference, "Thus, parallel to the finding that the 44 members as a 
whole and the members of separate T Groups did not become more alike in 


their perceptions of the group leader was the finding that these members did 
not become more alike in their ratings of self. 
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THE RELATIONSHIP OF ILLNESS AND HEALTH BEHAVIOR 
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А. INTRODUCTION 


The University of California at Los Angeles (UCLA) Peace Corps study 
began as a study of the effectiveness of health training. The usual paper-and- 
iken before and after health training; and changes in 
ard illnesses were assessed (6). 
pated before-after changes in the paper-and- 
a problem of greater significance 
lem of actual behavior change which would reduce morbidity. 
ing to countries with far different health prob- 

health practices if they are to avoid unfamiliar 


pencil tests were t: 
knowledge and attitudes tow 

The investigators had antici 
pencil tests. "They believed, however, that 
was the prob 
Peace Corps volunteers mov 
lems must adopt radical new 


and unpleasant illnesses. 
'The problems involved in exploring behavior changes and morbidity re- 


duction in this study were numerous. The major ones follow: 
(а) In the few studies where attitudes and behavior have been jointly 
studied, no very strong relationships have been found (3). Perhaps the lack of 
attitudes and behavior arises from our inability 


sociations between 
vating attitudes. (The student dislikes examinations but 
favorable attitude toward staying in school.) 
ant attitudes can be measured, the appro- 


priate behaviors cannot be. In controlled studies, limited single-event behaviors 

may be measured, but complex ones cannot. Even the simple behavior must be 
s » . . m 

1 be measured in any practical sense. А visit to 


narrowly defined before it can be Hm д 86s А 
а college health clinic can be verified, but a visit to a private physician (during 


significant as: 

to measure the moti 

he takes them because he has а 
(b) If it is assumed that the relev: 
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the summer, for example, when large numbers of students move to large 
numbers of residences) cannot be verified. Trained observers cannot agree 
upon the beginning and ending of a complex behavioral interaction. Even if 
behavioral measurement were possible, it would not be practical to attempt 
measurement of behaviors overseas for a two-year span for several hundred 
Peace Corps volunteers. 

(c) The measurement of morbidity is often imprecise.? Self-report has 
consistently produced underestimates of at least some kinds of illness LT. 3. 
Physician visits, even where they can be determined, reflect only those ill- 
nesses for which the individual believes he needs or wants treatment. Data 
from the present study demonstrate individual differences in the extent to 
which symptoms are judged to warrant a physician visit. 

(4) Health behavior and morbidity measures may be the same or they may 
not. A physician visit may be an index of illness or morbidity; but the con- 
scientious Peace Corps volunteer may make physician visits to preserve good 
health or to insure early diagnosis and treatment, so that physician visits be- 
come a measure of good health behavior. 

With full awareness of these difficulties, the investigators proceeded with 
the assumption that useful data could be collected to begin the task of relating 
illness and health behavior to paper-and-pencil tests of attitudes and informa- 
tion. The measure of illness and health behavior used was physician reports, 
which are filled out when a volunteer visits him. It was assumed and later 
verified that many illnesses among the volunteers were closely related to 
health behavior. "That relationship suggested morbidity as a negative measure 
of behavior. In the morbidity reports prepared by Peace Corps physicians, the 
incidence of amebiasis, dysentery, and other gastro-enteric diseases was sizable 
and was exceeded only by colds or flu. This high incidence is indication of a 
violation of recommended practices in preparation and ingestion of food and 
drink, As an extreme but clear example arguing for the use of morbidity 
measures as indices of health behavior, consider the incidence of venereal 
disease: the absence of illness does not tell us that certain behaviors did not 
take place, but the presence of the illness tells us that it did. 

Given crude morbidity measures and a battery of psychological tests, the 


? Initially it was felt that valid and reliable morbidity indices would be obtained 
by examination of stool specimens and of sera, Unfortunately, analysis of the first 
set of stool specimens produced such a low frequency of positive laboratory results 
that these data were not analyzed. Similarly, of the first 63 serum samples analyzed 
by the Communicable Disease Center (CDC), Atlanta, Georgia, only one showed a 
disease which might have been contracted in Africa; so that CDC does not plan to 
analyze the remaining specimens, 


— 
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investigators offer some hypotheses relating health behavior and illnesses to 
attitudes, and suggest more or less promising avenues for further investigations. 


B. METHOD AND PROCEDURE 


Peace Corps volunteers trained at a single university and headed for a 
single African country were given a battery of tests at the beginning of 
training (Exam 1), at the end of training (Exam I1), after six to eight months 
in the field (Exam III), and at the end of two-year service (Exam IV). 
Peace Corps physicians’ Monthly Morbidity Reports were examined for all 


volunteers. Step-wise discriminant functions were used to relate the paper-and- 


pencil tests to the crude morbidity measures. 

Before and after training, a health information test of 24 multiple-choice 
items and 10 matching questions was given. Most of the items had been 
previously used in Peace Corps health training courses. At the end of the two- 
year service period 10 relatively common host country illnesses were listed 


and the volunteer was asked the signs and symptoms of each illness. 


A previously constructed form of the semantic differential (5) was used 


before and after training and in the field, to assess two major dimensions of 
judgment toward illnesses. The first dimension, “seriousness or severity,” is 
attitudinal in nature, and the second major dimension measures belief about 
the “avoidability” of illnesses. The initial tests (before and after training) 
measured these two dimensions for 25 selected illnesses; the field test was 
shortened to 11 representative illnesses. 

A special form of the semantic differential was constructed and pretested 
for use in the present study to indicate the likelihood with which volunteers 
expect to follow major recommended health related behaviors. The scale 
was not developed as a general measure of attitude toward arie asa 
general description of the means by which behaviors pi proie ins i Tf was 
constructed solely to determine the expressed likelihood of d al 
practices. Twenty-seven behaviors Were included ipn tests given during the 
training period; 12 of the 27 were included in the field tests. T 

At the end of training, а COUISe reaction sheet of the traditiona ee 
type was administered ; for example, уо s asked if they found the 
со i ative, clear, interesting, and so torth. . 

pecu Monthly Morbidity Reports from the sape ue с 
number of physician visits, number of hospitalizations, number ot 1 nesses with 


a psychological component, and number of days lost from work. While it is 


3 Computation of scale scores is explained by Clark and Kerrick (2). 
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possible that some volunteers received treatment from non-Peace Corps physi- 
cians, those visits must be reported to the Peace Corps before bills would be 


paid. Thus, it is likely that the records include the majority of physician visits 
during the service period. 


C. RESULTS 


For (a) the total number of physician visits, (5) the days which the 
volunteer missed work because of illness, (c) the days the volunteer was 
hospitalized, and (4) the presence or absence of an illness with any psycho- 
logical component, two sets of step-wise discriminate functions were per- 
formed for each index. Essentially, the questions asked were these: first, can 
the number of physician visits, days missed work, days hospitalized, or psycho- 
logical illness be predicted from the paper-and-pencil tests given the volunteers 
before and after training; and second, in view of the changes in attitudes 
and expected behaviors over field-service period, can better prediction of these 


morbidity variables be achieved by using all paper-and-pencil tests including 
those taken overseas? 


1. Prediction of the Number of Physician Visits 


Table 1 gives the best prediction achieved for extreme groups of volunteers 
in regard to physician visits, the variables for prediction, and the F level or 
significance of that prediction. With only tests administered in this country 
before the volunteer left for overseas service, three tests give us the best pre- 
diction of whether or not a volunteer will have a large number of physician 
visits. Those with the most visits showed a higher personal health likelihood 
score on Exam I (pretraining), they found illnesses more avoidable after 
training, and they also showed a higher personal health likelihood score on 
Exam II (post training) than did those with fewer visits. 

The proportion of correct placement into one of the two groups was not 


TABLE 1 
PHYSICIAN VISITS PREDICTED BY THREE TESTS AND PH 


YSICIAN  VistTS REPORTED 
(Exams I and II only) 


Number visits Number visits predicteda d d ERE 
reported 0-3 4+4 ый 
"n 10 ze 116 aes 
x 16 29 19 
Total 86 75 ia 
z Note: Percent correctly predicted: 61.5; F = 2.10; df = 3,157; pc Com M" 
а Predictive tests: (a) Likelihood, Personal Health, 


ikeli Pretraining; (b) Avoidability 
of Illness, Posttraining; (c) Likelihood, Personal Health, Posttraining. ; 
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high. Roughly 61 percent of the volunteers were correctly placed in one of the 
two groups, with about 30 percent false negatives, and about 40 percent false 
positives. "Thus, close to 40 percent of those whom the authors felt would 
have any physician visits, actually did not, and about 40 percent of those 
whom the authors felt would have few physician visits, actually had many; 
yet the Ё ratio approaches significance. 

When all tests are considered, no improved prediction is achieved. When 


the grossness of the measures and the time period are considered, it is sur- 
ound between tests given in this country, 


prising that any relationship at all is f 
It is interesting 


and physician visits during the two-year service visits overseas. 
to note, however, that speculations made from summer observers’ reports ap- 
pear justified. Physician visits should not be considered as a measure of 
morbidity alone, but should be considered partially as a measure of good health 
The volunteer, who, in this country, says that he is most likely to 


behavior. 
nal health behaviors also appears most likely to visit a physician 


follow persor 


when he is ill. 
Further evidence of this confounding of morbidity with good health be- 


havior is obtained by a closer examination of the scores on likelihood and 
health behavior. This scale tends to be related in curvilinear, rather than 


linear fashion to each of the morbidity measures. Thus, if volunteers are 
separated into three groups rather than two (those with no physician visits, 
an average number of ph physician visits), 


ysician visits, and a great many 
volunteers are found who express à low likelihood of following recommended 
behaviors in both extremes. The v 


olunteer unlikely to follow recommended 
personal behaviors tends either not to visit a physician or has so many illnesses 
that he shows many physician visits. 


(It should be noted that the discriminant 
function is a linear function. The pronounced curvilinearity found especially 
ases the ability oft 


in the likelihood scale decre: he investigators to p "d 
quately the number of physician visits. It 1$ possible that, if number e : 
of physician visits were recorded and analyzed separately according to the P 


sumed behavioral cause, better prediction would result.) 


2. The Prediction of the Number of W orkdays Missed 
n volunteers reported 
t section deals w 


missing work because of illness, 
ith the small grouP of 


By and large, whe 
ess when they were not 


they were hospitalized. The presen 
Volunteers who reported missing work because of illn 
hospitalized. 

Four variables, extracte 
Ч to predict correctly 67 
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on analysts, 
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Р „wise discriminant functi j 
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a little over 30 percent false positives (Table 2). Again, this prediction is 
based initially on tests given at the training sites. Those variables entering 
into the prediction were posttraining avoidability scores (those who missed 
work felt illnesses were much more avoidable) the second and first knowledge 
tests (where volunteers who reported missing work initially scored lower on 


TABLE 2 
Days Missep Work PrepicTep BY Four Tests AND Days REPORTED 
(Exams I and II only) 


Reported days Predicted days misseda 
missed None Some Total 
None 98 49 147 
Some 4 10 14 
Total 102 59 161 


Note: Percent correctly predicted: 67.1; Е = 2.25; df = 4,156; р < .10. 
а Predictive tests: (а) Avoidability of Illness, Posttraining; (b) Knowledge, Post- 
training; (c) Knowledge, Pretraining; (d) Course Reaction. 


the knowledge test, but after training scored higher on the knowledge test) 
and finally, Guttman scores derived from judgments of the personal health 
section of the health training (those missing work expressed greater dislike 
for the health training section). Again, the F ratio approaches significance, 
falling beyond the .10 level. 

With the use of all paper tests, prediction of those who will report missing 
work because of illness is not greatly increased, but the same level of pre- 
diction can be reached by using only three variables. The first variable in 
prediction remains the judgment of the avoidability of illness on Exam II 
(posttraining) ; the second variable indicates that at the end of training those 
who reported missing work also reported a much higher likelihood of following 
recommended personal health behaviors, and a somewhat lower knowledge 
of frequent symptoms of illnesses at the end of service knowledge test. It 
would appear that responses to the end of service questions about the likeli- 
hood of following personal health may well be the results o£ experience with 
illness, rather than a causative factor in determining who will be ill enough 
to miss work. Those volunteers who, during their overseas service, h 
forced to miss work because of illness, even though that illness was not severe 
enough to cause hospitalization, are perhaps better able to appreciate the neces- 
sity for following recommended health-related behaviors, On the other hand, 
these volunteers know fewer of the symptoms which accompany the illnesses 
in the host country (4). 

After a review is made of the training test scores which predict the days 


ave been 
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missed work in relationship to those scores which predict physician visits, it 
is interesting to note that the single scale in common with the two is the oid- 
ability score on Test II (posttraining), where both those with a higher number 
of physician visits and those who miss work see illness as more avoidable than 
a group with an average number of physician visits and the group who failed 
to report days missing work because of illness. It may be argued, of course, 
more volunteers missed work because of illness than actually re- 
seem likely that some volunteers did miss some work 
impossible from our data, however, to assess the degree 


of underreporting in this area. o say, however, that those who 
reported missing work because of illness actually did miss a full day of work 
on each occasion that they reported it. In this respect again, ability to predict 
may be limited by the accuracy of reporting, and this perhaps reflected in the 
of false positives than in the number of false negatives. It is 
at some of those whom the authors predicted would miss 
ork because of illness but simply failed to report it. 


that many 
ported it; and it would 


and fail to report it. It is 
It seems safe t 


greater number 

quite possible th 

work actually did miss W 
3. The Prediction of Days Hospitalized 

significance at the .10 level (Table 3) for the 

prediction of days hospitalized. In the step-wise analysis, adding any variable 


past the first simply increases the false negatives. Although this analysis did not 
F level even approaching significance, it is instructive to look at 


The F ratio failed to reach 


produce an 


the variables extracted. | 
nteers left this country, the first three 


Of those tests taken before the volu left 
variables extracted by analysis are Exam I, likelihood of personal health be- 


havior; Exam TL. avoidability ; and Exam 11, likelihood of personal behavior. 
Those who had exte hospitalization showed a much lower likelihood of 


nsive н 
following recommen health behaviors than did those without 


ded personal 


TABLE A Е 
ICTED BY ONE VARIABLE AND HOSE REPORTED 
Days HOSPITALIZED Pe, ата Т and II only) 


Hospitalization predicted 


Doc MM ®т m 
7 57 
76 133 
Some п 16 27 
Total 87 z 150 
7 F = 180; df = 1148; P > 10 


ERES 
em icted : 61.3 . 
уйк Percent m pred {hoo E Intended Health Behavior, Pretraining; (5) 
At eredietive tines posttraining: (c) Likelihood of Intended Health Behavior, 
Posttraining. 
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extensive hospitalization for both Exam I and Exam II. Those with extensive 
hospitalizations did report increased likelihood of following recommended 
personal health behaviors after training than before, but at both pre- and post- 
training tests they were well below the average likelihood for the group which 
did not have extensive hospitalization. Also, although the difference was small, 
those with hospitalization had a slight tendency to find illnesses more avoid- 
able. 

Adding all tests, including those taken in the host country, fails to provide 
any improved prediction of those who will have extensive hospitalization. The 
same three tests—Exam I, personal health; Exam П, avoidability ; and Exam 
II, personal health—are extracted by analysis. Examination of results of the 
full set of tests, however, suggests that extensive hospitalization does produce 
change itself. "Thus, the group with extensive hospitalization reports a slightly 
higher likelihood of following recommended personal health behaviors at 
Exam period IV and at the end of training. 

It is interesting to see that the number of physician visits and the days 
hospitalized are differentiated by the same tests. For both morbidity measures, 
the first tests found important were extracted in the same order. In general 
then, it might be said that those volunteers who express a high likelihood of 
following recommended personal health behaviors on both Exams I and II 
will show a higher number of physician visits but a lower number hospitaliza- 
tions. (Again, the same curvilinearity of the personal health behavior scale 
probably decreases ability to predict correctly the days hospitalized. However, 
the prediction rate remains at about 60 percent correctly predicted.) 


4. Prediction of Volunteers with Any Illness Having 
a Psychological Component 


It must be admitted that this scoring category is probably the least reliable. 
The rating of the psychological component of illness is dependent on physician 
judgment; it is, therefore, susceptible to interphysician variability, N 
less, with the realization that the work was conducted with very gross mea- 
sures of morbidity, tests of those 57 volunteers who showed any illness which 
was rated as having a psychological component on the physicians’ Monthly 
Morbidity Report were examined statistically. With reference to only those 
tests administered in this country, three variables predict with almost 60 
percent accuracy, the F ratio falling just short of .10 (Table 4). Adding all 
exams fails to increase predictability appreciably, although it does decrease 
the number of false positives: that is, those who are predicted to have illnesses 
with psychological components, but who indeed do not. When analysis is 


onethe- 
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TABLE 4 
NuMBER OF VOLUNTEERS HAVING AN ILLNESS WITH ANY PsYCHOLOGICAL COMPONENT, 
PREDICTED BY THREE VARIABLES 

(Exams I and II only) 


Reported illnesses Predicted illnesses with psychological 


with psychological component? 
component None Some Total 
None 77 61 138 
Some 28 Li 72 
"Total 105 105 210b 


Е Note: Percent correctly predicted: 57.6; F = 2.03; df = 3,206; p > .10. 
а Predictive tests: (а) Severity of Illness, Pretraining; (b) [Knowledge, Post- 


training (not used in prediction) ]; (с) [Knowledge, Pretraining (not used in pre- 


diction) ]. 


b Number includes volunteers who left before the end of service. 


hen analysis is extended to all tests, 


limited to tests given in this country or w 
ses depends primarily 


it is found that prediction of psychologically related illnes: 
upon attitudes toward illness and upon knowledge. 

"With reference to tests given in this country, those who eventually have 
illnesses with psychological components find illnesses more severe than do 
subjects not having illnesses with psychological components. They know 
slightly more about illness at the beginning of training and somewhat less at 
the end, as measured by knowledge tests I and II. Additionally, they find ill- 
nesses more avoidable at the beginning of training. When all tests are con- 
sidered, the most important difference between the two groups is in the 
knowledge test given at the end of training; the group with any psychological 
component to their illnesses knows more illness symptoms than does the group 
with no illnesses having psychological components. While the group having 
illness with a psychological component originally found illness more avoidable, 
at the end of service examination they found it less avoidable than did those 
without similar psychological correlatives to illness. 

Of the four morbidity indices used, two of them—the days of work missed 
due to illness and the presence of an illness with a psychological component 
—appear to be related to judgments of illness and to knowledge. The other 
two—the number of physician visits and extensive hospitalization—appear 
more strongly related to the expectation of following recommended personal 


health behaviors. 
D. SUMMARY AND DISCUSSION 


ken in this country before Peace Corps volunteers 


-pencil tests ta i 
d in Africa were related to four relatively crude 
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left for a two-year service 
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measures of morbidity: (a) number of physician visits, (2) days missed work 
because of illness (without hospitalization), (c) hospitalization, and (4) ill- 
ness rated by the Peace Corps physician as having a psychological component. 

Attitude toward illness (or judgment of the severity of the illnesses common 
in the host country) was related only to the opinion of the physician that the 
volunteer had some illness with a psychological component. А very tentative 
suggestion may be made that attitude toward illness may be predictive of 
psychosomatic illness or to somatizing of work or living pressures rather than 
to more definite physical illness. 

Belief in the avoidability of illness, however, is related to all of the other 
three measures of morbidity. The scale of likelihood of following recommended 
personal health behaviors is related both to the number of physician visits and 
the hospitalization for illness, in conjunction with judgments of avoidability. 
The relationships are not simple ones, and the present data indicate a need for 
greater refinement and precision in the measurement of morbidity and in the 
paper-and-pencil tests used. 

Because so many variables other than attitudes, information, and intended 
health behavior may be important ; because of the gross categorization of illness 
and morbidity; and because of the two-year time span, it is surprising indeed 
to find that the tests related to the authors’ morbidity measures and gave some 
indications of the volunteer’s health-related behavior in the field. 

Health behavior, itself, is an intervening variable in only some instances, 
and illness will result only occasionally when poor health practices are fol- 
lowed. Nonetheless, the number of physician visits (very few vs. many) and 
the days hospitalized because of illness (none or many) are highly related 
to scores of the personal health behavior tests, especially the tests given prior 
to service abroad. These same two gross morbidity measures are related to 
judgments of the avoidability of illness which the volunteer makes immediately 
following training. 

Future investigations might well focus upon more refined measurements, 
and upon the complex interaction between health behavior, illness experience, 
and subsequent health behavior. я 

Implicit in the present data is the proposition that some people learn from 
training or course work, resolve to follow recommended practice, and, in 
general, remain healthy. Others, conversely, who felt themselves ill enough 
to miss work (although not so ill that they required hospitalization) learned 
little of the signs and symptoms of illness, but ended service with greater verbal 
resolution to follow recommended practices. For this latter group, we might 
well ask whether there is some way to teach them to recognize real illness, 
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and some relatively easy way to convince them to take reasonable precautions 


to preserve their health. 


1. 


2. 


3. 


4. 
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A NOTE ON THE USE OF DISCREPANCY SCORES IN 
A SELF-CONCEPT INVENTORY* 


Department of Psychology, University of Manitoba 


SHIRIN SCHLUDERMANN AND EDUARD SCHLUDERMANN 


Factor analyses of self- and ideal-ratings on Worchel’s self-concept inventory 
(2) failed to yield meaningful unidimensions for the replicated sample of 
college boys and girls (1). It was considered necessary, therefore, as the next 
logical step, to examine carefully the discrepancy scores between the self- 
and ideal-ratings with a view to obtain basic dimensions which would account 
for maximum variance in discrepancy scores. Since the discrepancy scores are 
taken as indices of adjustment or maladjustment, it is all the more important 
to clarify what is involved in the discrepancy measures. Discrepancy scores 
were obtained on each item between the self- and ideal-scores for each S. Inter- 
correlations among discrepancy scores on 54 X 54 items were obtained for 
boys and girls separately. Two sets of factor matrices were obtained from two 
sets of correlation matrices. Factoring was done by the principal axes method 
and rotated orthogonally by the varimax solution. The first factor accounted 
for 16 and 12 percent of variance respectively, in boys’ and girls’ factor 
matrices. A total of seven factors in the boys’ case and eight factors in the girls’ 
case accounted for approximately 40 percent of the total variance. As is clear, 


there are no common variances; hence the discrepancy scores consist of a large 


number of specific factors. | . | 
'To obtain the relative contributions of the ideal-ratings and self-ratings 


to the discrepancy scores, product-moment-correlation coefficients were com- 
puted for boys and girls separately (see Table 1). 

The results agree with Wylie's statements (3, p. 105) that self-scores con- 
tribute most of the variances to the discrepancy scores. However, there are 
interesting. differences between the sex groups; the correlation between self- 
and ideal-scores is near zero for girls, whereas the correlation for boys is .33. 
In the girls’ case, the correlation between self vs. discrepancy scores was found 
to be significantly higher than the correlation between ideal vs. discrepancy 
scores. In the boys' case, the correlation of .62 was not found to be significantly 
different from a correlation of 46. The analyses reported here indicate that 
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TABLE 1 
PnRopucr-MOMENT-CORRELATION COEFFICIENTS BETWEEN IDEAL SCORES, SELF-SCORES, 
AND DISCREPANCY BETWEEN IDEAL- AND SELF-SCORES 


Replicated Normative 
sample sample 
Boys Girls Boys 
Scores (N = 177) (N — 150) (N — 153) 
Ideal ws. Self 33 .05 49 
Ideal vs. Discrepancy A6* .30* 
Self vs. Discrepancy .62* .60* 


* Significant at .01 level. 


Worchel's self-concept inventory is neither a unidimensional nor a multi- 
dimensional scale. However, the overall composite scores on self and ideal and 
the discrepancy scores may be positively correlated to other tests. 
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PREDICTION OF SUPERVISORY SUCCESS BY USE OF 
STANDARD PSYCHOLOGICAL TESTS* 


Department of Psychology, Muskingum College 


Н. Epwin Titus! 


А. INTRODUCTION 


Widespread use of psychological tests for personnel appraisals in selection, 
placement, and promotion considerations is a fact of life in industry today. 
Barrett (1) has pointed out that appraisal of the man without specification of 
his work situation is a “patent medicine” approach and is held in low esteem 
“for the simple reason that it is unlikely to work” (1, p. 141). Test-based 

rays some of which are legitimate and so- 


appraisals are used in a variety of w в 50 1 ; d sc 
phisticated. Sadly, however, many applications are highly questionable. Within 


the limitation of a clear definition of a specific situation, both the statistical 
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showing the patterns of relationship of a substantial group of predictors against 
a diversified set of criteria in a common setting with all data based on a homo- 
geneous sample of line supervisors. Such data can provide a starting point for 
examining measures that may be considered in specific local studies of per- 
sonal factors in supervision. 


B. METHOD 


Аз a part of a larger investigation, nine tests vielding 38 scores were ad- 
ministered to 67 line supervisors at a large U.S. Government supply depot. 
Participation in the study was voluntary, but strongly encouraged by higher 
management. Those who participated were given time off from their regular 
duties and were tested in special testing sessions which took place during 
regular working hours. Participants were assured that the study was purely 
for research purposes and not a basis for administrative action. The vast 
majority of the middle-level supervisors participated; however, for various 
reasons it was not possible to obtain complete data on a few of those who were 
willing to participate. The 67 subjects mentioned above included only those 
for whom the data were complete in all categories. 

The battery of standard psychological tests included The American Council 
on Education Psychological Examination (ACE), two variables; The Cali- 
fornia F Scale; The Empathy Test; The Hidden Figures ‘Test; The Guil- 
ford-Zimmerman Temperament Survey (G-Z), 10 variables; How Super- 
vise?; The Sixteen Personality Factor Questionnaire (16 PF), 16 variables ; 
The George Washington University Social Intelligence Test (SIT), five 
variables; and The Watson-Glazer Critical Thinking Appraisal (W-G). 

The criterion measures were chosen on the basis of the availability of 
information and the interests of higher management. Since any modern con- 
sideration of industrial validity of psychological tests must recognize that the 
day of seeking the "ultimate criterion" is long past (cf., 2), it was considered 
essential to have as broad a spectrum of criterion measures as possible, "The 
criterion measures included four morale variables, six measures of the per- 
formance of each supervisor's work group,? and a proficiency rating for each 
supervisor provided by their superiors. The specific variables were as follows: 
morale concerning working conditions and environment (M-1), morale con- 
cerning financial reward (M-2), morale concerning supervision (M-3), mo- 
rale concerning management and administration (M-4), average number of 
accidents (P-1), average number of suggestions submitted (P-2), average 


dp 
з [n a service facility like the present research setting, production measures af 
performance in the conventional sense were, of course, not possible. 
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number of awards received for suggestions (P-3), average number of subordi- 
nates receiving awards for superior performance (P-4), average number of 
hours lost due to sick leave (Р-5), average number of hours lost due to unau- 
thorized absence (P-6), and proficiency rating of supervisors (R-1). 


C. RESULTS 


A list of the tests and the variables which they purport to measure is given 
in Table 1 along with a code designation, mean, and standard deviation for 


TABLE 1 
Cope DESIGNATIONS, MEANS, AND STANDARD DEVIATIONS OF THE 'TEST VARIABLES 
Standard 

Code Test Variable name Mean deviation 
= Linguistic Ability 41.64 15.57 
ЩТ pee Quantitative Ability 20.14 11.08 
т4 F Scale Authoritarianism 134.61 23.65 
"at Empathy Mass Empathy 72.30 19.14 
T-5 HFT Percep. Reorg. 5.18 2.83 
T- G- Activity 16.33 5.37 
ce G-Z Restraint 17.22 4.57 
у GZ Ascendance 15.12 472 
T-8 tZ Social Interest 18.20 574 
3 rte Emotional Stability 18.93 4.76 
T-10 67 Objectivity 17.16 5.22 
Т-11 G-Z Friendliness 14.79 5.85 
T-12 G-Z Reflectiveness 18.78 4.19 
1-13 G-Z Cooperativeness 17.54 5.08 
T GZ Masculinity 17.88 421 
T-15 Ho ‚ Sup? Supervisory Knowl. +142 11.11 
1-16 ӨРЕ o Cyclothymia 8.36 2.72 
Т-17 J PF Intelligence 6.01 1.92 
T-18 16 РЕ Ego Strength 15.08 2.97 
T-19 16 UE Dominance 12.50 3.22 
T-20 16 F Surgency 12.71 3.16 
Т-21 16 E Super-Ego Strength 13.08 2.62 
T-22 16 E Parmia 11.61 E 
r^ 16 PF Premsia i 3.35 
T-24 16 PE Protension 8.38 3 
T-25 16 BE дона 11.07 27 
T-26 16 P Shrewdness 1142 2.72 
1-27 16 PF Guilt Proneness 10.62 3.68 
T-28 16 PF Radicalism 8.62 2.28 
T-29 16 ВЕ Gelf-Sufficiency 8.96 3.10 
T-30 16 PF High-Self Sentiment 1141 2.58 
T-31 16 PF High Ergic Tension 11.99 4.02 
T-32 16 PF Social Judgment 19.24 3.63 
T-33 SIT Rec. Ment. State 23.70 6.74 
A SLT Obser. Hum. Beh. 35.29 5.86 
T-35 SIT Mem. Names/Faces 947 6.37 
T-36 SIT Sense of Humor 10.83 4.18 
T-37 ББ Critical Thinking 51.12 10.23 
T-38 Wi 
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TABLE 2 
Supervisors’ TEST SCORES CORRELATED WITH MEASURES OF SUBORDINATES’ MORALE 
AND PERFORMANCE AND SUPERIORS’ RATINGS OF SUPERVISORS 


Variable М-1 M-2 M-3 M-4 P-1 Р-2 P-3 P-+  P-5 P-6 R-1 


T-1 02 18 30 14 12 34 27 1+ 06 07 09 
T-2 06 —06 06 —07 09 01 07 02 00 05 20 
T-3 15 —03 —02 —10 —08 —09 —20 11 —21 07 16 
T-4 04 07 21 26 —04 21 10 17 03 08 02 
T-5 06 —02 —0+ —16 04 —08 —10 —09 —06 —04 —05 
T-6 —06 —14 09 01 06  —09 —11 —18 12 —06 11 
T-7 01 —07 —03 00 03 00 08 —10 —09 05 10 
T-8 —28 —29 —03 —24 12 —10 —21 01 01 02 —06 
T-9  —21 —26 —08 —12 05 —28 —44 —08 —02 09 00 


T-10 —06 —17 —02 —12 —07 05 —11 12 13 —06 03 
T-11 —10 —13 —03 —06 —06 —01 —15 04 05 о п 
1-12 —06 05 —05 03 —13 01 —13 04 —01 09 08 
T-13 —02 —12 02 —04 05 03 03 14 —09 —08 06 
т-м 04 07 10 17 —09 07 —08 07 01 05 26 
Г-15 —03 —06 —08 —02 07 20 —06 12 21 —27 25 
T-16 02 —0 п 14 0 19 —0 23 п 05 0 
Г-17 —13 —21 25 —04 —06 —24 —11 —18 —07 —03 03 
T-18 21 09 23 14 10 12 18 —01 —01 08 11 
T-19 10 —05 —12 —03 03 05 —14 05 —11 —10 0+ 
T-20 —06 —09 04 —10 01 —07 —30 20 —02 —1 —11 
T-21 05 01 17 01 10 —01 —18 14 10 03 —23 
T-22 14 06 —15 03 00 —02 —12 —01 —03 14 — 0l 
T-23 —08 —09 13 —09 03 —22 —31 —11 08 02 —15 
T-24 —05 —05 22 14 —03 —08 21 —14 —04 —06 02 
T-25 —16 08 12 о 06 00 20 —01 00 —05 —13 
16 —13 —07 —05 —14 05 —19 —05 —12 —05 —o1 —08 
aar v 17 —25 06 —21 —03 08 01 08 00 —16 
28 y 23 13 17 —07 05 06 21 —10 01 —23 
а — —15 —35 —09 —04 —17 —16 —07 14 01 02 

0 —08 07 —10 09 07 24 25 08 05 11 00 
T-31 —11 —20 —31 —34 09 —02 —09 —17 04 —08 06 
T-32 00 12 14 10 16 20 27 16 —03 —02 —22 
T-33 —06 19 25 95 93 17 25 03 —07 —11 —02 
T-34 —04 13 23 10 02 20 22 оз 01 11 00 
1-35 —6 09 14 06 o6 09 19 06 —09 —04 —16 
T-36 10 06 —03 08 05 —02 06 —08 —22 09 12 
T-37 00 21 17 06 12 30 25 16 02 05 05 
T-38 —01 —02 12 05 14 04 13 —13 —12 —02 06 


Note: For simplicity of presentation deci i i 
_No ecimals are omitted. r's of .24 or greater are 
significant at the .05 level. 7’s of .31 or greater are significant at the .01 level. 


each. "Table 2 presents the correlations between the test variables and each 
of the criterion variables. Included are 27 statistically significant rel 


к ч f ationships 
involving 16 test variables and eight criterion measures. 


D. Discussion 


The absence of spectacular relationships between any of the predictors and 
any of the criteria suggests strongly that caution must be exercised in the 


1 


4 


H. EDWIN TITUS 39 


rd tests to broad evaluations of individuals 
mination of Table 2 shows that many of 


the seemingly relevant test variables showed only negligible relationships with 
measures intended to define some aspects of supervisory success. Since the 
criterion variable R-1 comes closest to the conventional criterion measure in 
this area, it is particularly instructive to trace the last column of Table 2. 
The many near-zero correlations are an indictment against a simple clinical 
interpretation of the scores. 

The tests used in this study showed some useful and meaningful relation- 
ships with criterion measures which, if carefully organized and weighted, could 
provide quite satisfactory prediction of some of the criteria. For example, five 
test variables were significantly related to the M-3 criterion. These variables 
taken together produced a multiple correlation of 52 (r = 46 after correc- 


hrinkage) against that criterion. Also, five test variables were sig- 


tion for s 
nificantly related to the P-2 criterion; these variables produced a multiple 


correlation of .46 (r = 39 after correction for shrinkage) against the P-2 
criterion. And, as а final example, eight variables were significantly related 
to the P-3 criterion. They produced a multiple correlation of 58 (r= 50 
after correction for shrinkage) with P-3. Since it is very likely that the pre- 
dictive utility of any combination of test measures will vary from setting to 
setting and from one type of supervisor to another (4), these examples are 
intended only as illustrations and certainly not as suggestions of variables that 


would be universally useful in making such predictions. 

‘The main thrust of the data presented supports the criticisms of the 
“patent medicine” approach to industrial selection and prediction and makes 
clearer the hazards involved in developing “clinical impressions” of potential 
success on the basis of standard tests often used for that purpose. 
the varying patterns of relationship of the test variables 
teria suggest strongly that as we move away from thinking 
terion of success and move further in the direction of 
ultifaceted conceptions of success, testing devices like 
praisal battery" may, indeed, provide useful infor- 
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diction for three of the criteria: morale concerning supervision, average number 
of suggestions submitted, and average number of awards received for sug- 
gestions. 

Despite moderate success in relating standard psychological tests to specific 
multiple criteria of supervisory performance, the data clearly demonstrated 


the danger involved in the usual approach to forming psychological appraisals 
from test results. 
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REVERSAL AND NONREVERSAL SHIFTS IN NEURO- 
LOGICALLY HANDICAPPED CHILDREN*? 


Department of Psychology, California State College, Los Angeles 


Ropert Н. REIHER, ]озЕРН С. PHELAN, AND VERNON L. KIKER, Jn. 


A. INTRODUCTION 


Research in conceptualization problems indicates that an ontogenetic pro- 
gression in concept achievement may exist through the comparison of kinder- 
garten children (7) and normal human adults (5). 

This study was conducted to determine the relationship between children 
with neurological impairment and adults with neurological insult on concep- 
tualization tasks involving reversal and nonreversal shifts as in Phelan and 
Gustafson (9). It also seemed important to determine the relationship that 
existed between both children and adults with neurological insult and normal 
adults, as in Kendler and D’Amato (5), and slow learning young children (7). 

Kendler (4, p. 44) defines a concept as “the acquisition or utilization of 

. a common response to dissimilar stimuli.” Several theoretical models have 
played an important role in influencing experimental designs. One such model 
is that of an S-R viewpoint. Both Pavlov (8) and Spence (10) have ex- 
pounded this approach with reference to conditional differentiation and dis- 
crimination learning. A single unit S-R theory assumes a direct association 
between physical stimuli and overt responses. Spence, utilizing this paradigm, 
limited his predictions to inarticulate organisms. Human beings, however, 
learn to make covert responses to physical stimuli. These responses produce 
stimuli that mediate between the external stimulus and the overt response. 

ng the first to use the mediational construct. According 

Hull (2) was ло i ion can initiate within the organism an event called 
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Kendler (4) suggested that experimental conceptual behavior can be described 
from an S-R. framework in two ways. One, which seems to fit the behavior of 
lower animals and young children, is to represent a common response to dis- 
similar stimuli in a single unit manner (S-R): i.e., with each overt stimulus 
being directly associated with an overt common response. For human adults, a 
mediation framework (S-r-s-R) is employed. This indicates that instances (5) 
of a conceptual grouping are transformed into an implicit representational 
response (r) that results in a primary cue (s) for an overt sorting response 
(R). The nature of this representational response (r) is that it typically 
represents an "abstraction" process on the part of the $. Therefore, within 
this theory, concept learning is the acquisition of a categorizing response to 
common mediators evoked by various otherwise unique stimulus patterns. Con- 
ceptual behavior is thus a special mediated chain involving multiple converging 
linkages between external stimuli and responses, through a discriminatory 
mediational process. 

An experimental technique currently employed in the investigation of con- 
cept formation within the mediation theoretical structure is that of the Re- 
versal Shift (RS). This paradigm is a meaningful representative of a transfer 
of training problem in a two choice situation in which a person’s previously 
correct responses are no longer correct and his previously incorrect responses 
now are correct. "Two basic types of shifts are used. For a reversal shift, the 
subject must learn to assign all formerly positive instances to the negative 
category and vice versa. The same stimulus dimension is relevant to both con- 
cepts, but the stimuli to category concepts are reversed. A nonreversal shift 
requires the subject to make these assignments on the basis of a different 
relevant dimension. Therefore, if attaining a concept is a process of associating 
physical stimulus attributed with overt category responses, it should follow 
that the reversal shift should be more difficult than the nonreversal shift. This 
follows from the fact that some proportion of the associations learned in the 
first concept is valid after the nonreversal shift. Reversal shifts, in contrast, 
require a change in all initially acquired associations. Thus, if the number of 
new associations to be formed (or old ones to be extinguished) is a measure 
of problem difficulty, reversal shifts should take longer to complete than non- 
reversal. 

Experimental studies using reversal and nonreversal shifts (5) with college 
students have demonstrated a more easily attainable reversal shift. Kendler 
and D’Amato reasoned that "a reversal shift occurred at a more rapid rate 
than a nonreversal shift because at the completion of the learning of the first 
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concept, the symbolic cues appropriate to the second concept were available 
to Ss in the reversal shift groups" (5). 

In a similar type of problem, rats find a nonreversal shift easier than a 
reversal shift (3). It is possible that a single unit S-R theory accurately repre- 
sents the behavior of rats, while mediational S-R theory is required to explain 
concept learning of articulate humans. 

In an attempt to locate a transitional point between single unit and media- 
tional theory, Kendler and Kendler (7) performed a study with kindergarten 


children, which showed that these children, as a group, executed a reversal 


and nonreversal shift at approximately the same rate. When these kindergarten 


children were divided into fast and slow learners on the basis of their per- 
formances on the first problem (training discriminations), slow learners per- 
formed during the second problem (test discrimination) according to the 


single unit theory ; like rats, they found a nonreversal shift easier. Fast learners, 
on the other hand, performed in accordance with the mediational theory; like 


college students, they found a reversal shift easier. These results were inter- 
demonstrating that these kindergarteners, taken as a group, were 
f developing mediating responses relevant to this task, and that 
some were further along than others. “The suggestion is strong, then, that as 
you ascend the evolutionary scale, organisms acquire a greater capacity to gen- 
at enable them to make rapid RSs” (6, p. 11). If the importance, 


the fact that humans utilize mediated language responses, a fast 
e subject is using a common mediated re- 


preted as 
in the process 0 


erate cues th 
then, lies in 
reversal shift may indicate that th 
sponse to classes of stimuli. A person responding to the stimuli in a single-unit 
fashion should characteristically show а slow RS. The fact that young chil- 
dren exhibit a slow RS can be used to indicate that they do not utilize a 


common response to stimuli. 
Goldstein and Gelb (1) studied intellectual impairment of soldiers suffer- 
frontal areas. There was in particular a distinct 
loss of ability to think conceptually. In sorting tasks involving multicolored 
yarns, these subjects showed a restricted concrete level of performance. In 
particular, they would not generalize widely or shift from one basis of classi- 
fication to another. Subsequent investigations have justified the conclusion 


that individuals with brain injuries, especially in the frontal areas, suffer a 

loss of abstracting ability. ! 
Heinz Werner (11, р. 240) states that the gradual growth of a unitary 

intellectual function involves specific changes in the process of abstraction 


itself; “One who can shift his point of view in a purposeful grouping activity 


ing traumatic lesions in the 
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is no longer subject to the forces of sensory stimulation. He is able consciously 
to perceive that objects have different qualities, any one of which may be taken 
as the point of departure for an ordering process. In other words, this devel- 
opment indicates an immensely important step away from an abstraction 
closely allied to sensory organization and toward an abstraction guided by 
deliberately selected categories, such as color, shape, number, size, etc." 
Werner points out pathological examples which he feels are concomitant 
with his developmental approach. Utilizing the work of Head, Goldstein, and 
Gelb with brain-injured patients, he points out the similarities in behavior 
in so-called abstraction experiments between the brain-injured adult and child. 
Phelan and Gustafson (9) have conducted research in reversal and non- 
reversal shifts in acute brain injured with injury diffusely localized patients 
at Sepulveda, San Fernando, and Brentwood V. A. installations. For the acute 
cerebrovascular accident patients, there seems to be an impairment in media- 
tional processes, In contrast to normals, a reversal shift did not occur at a 
more rapid rate than a nonreversal. It is in conjunction with these results that 
the current research with neurologically handicapped children from the Diag- 


nostic School for Neurologically Handicapped Children, Los Angeles, has been 
conducted, 


B. METHOD 
1. Subjects 


The Ss were 42 students consisting of 10 females and 32 males ranging in 
age from six to 15. All Ss have demonstrable neurological impairment of dis- 
parate origin. A total of 18 Ss were eliminated: nine could not solve the first 
concept; four could not perform the first stage, second concept; one subject 
was color blind; and the remaining four were eliminated for the purpose of 
equating groups. If these subjects were included in our final results, the level 
of significance of the differences comparing the relative effectiveness of reversal 
and nonreversal $$ would all be the same. $$ are resident patients at the 
Diagnostic School for Neurologically Handicapped Children, Los Angeles. 


2. Material 


The №. Y. О. Card Sorting Test, Form 2, was utilized. This test is 
patterned after the Wisconsin Card Sorting Test I, differing mainly in that 
it permits the testing of more abstract and complex concepts, In this experi- 
ment two sample cards were used: a large orange diamond and a small dark 
gray ellipse with pointed ends—similar in shape to a football. The form and 
color of the stimulus cards differed from those used in the response deck to 
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avoid having any response card be a duplicate of either stimulus card. The 
deck consisted of 32 response cards which varied in terms of four forms 
(circle, crescent, square, and a [-shaped figure), four “colors” (black, medium 
gray, yellow, and red), and two sizes. 'The designs appeared on three-inch 
square cards with the designs themselves being either 1 inch or 2 inches in 
height. 

The two major concepts used in this experiment were shape (S) and color 
(C). The shape concept required rectilinear shapes (square and [-figure) to be 
sorted below the large orange diamond, and curvilinear shapes (circles and 
the small gray ellipse. The color concept required 


crescents) to be sorted below 
orted below the small gray ellipse. 


achromatic figures (gray and black) to be s 


3. Design 

t was to compare the relative effective- 
ness of a reversal shift as compared with a nonreversal shift. This could be 
the 2 X 2 factorial design in which half of the Ss learned 
initially the shape concept and the other half learned the color concept, then 
half of the Ss in each of these groups learned a reverse shape concept and the 
other half of each group learned the reverse color concept. Thus four groups 
were formed: Shape-Reverse Shape (S-RS) 5 Shape-Reverse Color (S-RC) ; 
Color-Reverse Shape (C-RS) ; and Color-Reverse Color (C-RC). 

In such a design as this, Ss in the nonreversal groups (S-RC and C-RS) 
during the learning of the second concept would receive fortuitous partial 
of sorting responses associated with their first concept. For 
example, placing a yellow crescent below the large orange diamond stimulus 
card would be a correct response for S in Group C-RS during the learning of 
the first concept. It would also be a correct response for the same $ during 
the learning of the second concept. In fact, all rectilinear achromatic and cur- 
vilinear chromatic cards would provide partial reinforcement effects for the 
nonreversal Ss during their learning of the second concept. Those cards were 
therefore eliminated during the initial stage of learning the second concept. 
This resulted in Ss of both the reversal and nonreversal groups receiving 
100 percent nonreinforcement of their sorting responses which had been 
correct for the first concept. The elimination of these aforementioned cards 
had also other consequences. Only rectilinear chromatic and curvilinear achro- 
matic cards were left in the response deck during the first stage of the second 
concept. Since these cards resembled one of the two stimulus ie gen 
diamond, gray ellipse) both in terms of shape and color, they unis B ae 
priately sorted below the same stimulus cards for both the reverse shape an 


The main purpose of this experimen 


accomplished by 


reinforcement 
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TABLE I 


Schematic Representation of the Design 
of the Experiment 


First 
t 
Concept: Second Concep 
Shape or First Stage: Reverse Second Stage: Reverse 
Color Shape and Reverse Shape or Reverse Color 
Color 
roup S-RS 
Group 5 Group S-R (SC 
roup S-RC 
roup C-RS 
Group C Group C-R (SC 
Group C-RC 
Group O-RS 
Group O-R (SC) 
roup O-RC 


reverse color concept. 'That is, during this first stage, the sorting responses of 
the response cards in the abridged deck would be identical for both reverse 
shape and reverse color concepts, This point is represented in Table 1 by the 
abbreviation R(SC) which indicates that in the first stage of the second 
concept both reverse shape and reverse color sorting responses are correct for 
all Ss. 

This condition necessitated the reinsertion of the discarded cards during 
the second stage of learning the second concept. In order to compare the 
relative effectiveness of a reversal and nonreversal shift, it was necessary to 
have response cards which required different sorting responses for the reverse 


color and reverse shape concept. 


eh 


| 
| 
| 
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In order to discover whether the transfer effects resulting from a reversal 
or nonreversal shift were positive or negative, a control group (Group O) 
was added that did not receive any training on the first concept. The first ex- 
posure these Ss had to the experimental routine involved the first stage of 
learning the second concept (O-SC). The pattern of reinforcement to which 
the control group was subjected was identical to that received by the two 
major experimental groups. 

To summarize, the design, which is schematically represented in Table 1, 
was an extension of the 2 X 2 factorial design described at the introduction 
to this section. The extension was necessitated by the needs (a) to equate 
partial reinforcement effects for the reversal and nonreversal shifts and (b) to 
discover the nature (positive or negative) of the transfer effects produced by 
reversal and nonreversal shifts. The first was accomplished by using an 
abridged response card deck during the first stage of the learning of the 


second concept, which at one and the same time provided 100 percent nonre- 


inforcement of the sorting response initially learned by the reversal and non- 


reversal groups and which also did not provide any differential reinforcement 
of the sorting responses associated with the learning of the reverse shape or 
reverse color concept. The second need was accomplished by adding a control 


group whose training began with the first stage of the second concept. 


4, Procedure 


The mechanics of the experimental procedure involved a display panel for 
each stimulus card below which a small cross appeared. At the start of the 
experiment the following instructions were orally presented by E: 

You will be shown а series of cards one at à time. Some of these cards 
will belong with this card (E points to the first stimulus card) and others 
1 i E points to the second stimulus card). You must decide, 
ыр T in ; a that I shall show you, with which of these two cards 
[dune ' h y iius cards) it belongs. Indicate your choice by taking 
s ue Ls 3 ently tapping the mark which lies beneath the card of 
this pci = m gan is correct I shall say "Right," but if it is incor- 
your hall say “Wrong” Do you have any questions? 
n the various stages of the experiment was made 
dification in either pattern of scoring or total 
о be used. The Ss in all the groups were 
o reach a criterion of 15 successive correct responses in their learning 
d during the second stage of the learning of the second 
during the first stage of the second concept was 10 


The transition betwee 
without informing Ss of the mo! 
number of response cards that were t 


required t 
of the first concept ап 
concept. The criterion 
successive correct responses. 
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The Ss had a limit of five decks (160 cards) to reach the criterion of 
learning the first concept and for the second stage of learning the second 
concept. Ап $ who during the second stage of the second concept failed to 
solve the problem within the limit of five decks was arbitrarily assigned to a 
score of 160. When the 16-card deck was used during the first stage of the 
second concept, a maximum of six decks (96 cards) was permitted. The 
response deck during all stages of the experiment was presented in an order 
which was designed to preclude sequential learning. 


C. RESULTS 


The response measure used during all stages of the experiment was the 
number of trials, excluding the criterion trials to achieve the performance 
criterion, Table 2 reports the data associated with the learning of the first 
concept for normal adults, cerebrovascular accident Ss, and neurologically 
handicapped children. The data are presented both in terms of the performance 
of the total number of Ss in both Group С and S, as well as their respective 
subgroups. Table 2 includes a similar comparison of the normal adults, cerebro- 
vascular brain-injured adults, and neurologically handicapped children. Results 
were highly skewed and, therefore, the nonparametric Wilcoxon's Matched- 
Pairs Signed-Ranks Test was used to evaluate the differences.? For the neuro- 
logically handicapped children, similar to the brain-injured adults, the differ- 
ences between the total shape and total color groups were not significant. The 
differences between the two subgroups of neurologically handicapped children 
that were compared in the learning of the second concept (Group C-RC and 
C-RS and Groups S-RS and S-RC) were slight and insignificant (with the 
use of the authors’ equating procedure). The total median scores for the 
neurologically handicapped in both the shape and color concepts, much like 
the brain-injured adult, suggest that unlike the normal adults these concepts 
do not follow the patterns of initial dominant responses to the experimental 
situation. 

Table 3 presents the data for both areas of the first stage of the learning 
of the second concept. With the neurologically handicapped child, there were 
no significant differences between the reversal and nonreversal groups; how- 
ever, there was a much higher range in the neurologically handicapped child's 
score, which is similar to those findings with the brain-damaged adult. Unlike 
the normal adults, the neurologically handicapped child in the O-RS and 
O-RC groups showed inconsistencies indicating a lack of clear-cut transfer 


2 Because of the fact that the results were highly skewed, the Wilcoxon Matched- 
Pairs Signed-Ranks Test was used for all comparisons. $ 


/ 
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Table 2 


NUMBER OF TRIALS REQUIRED TO LEARN THE FIRST CONCEPT 


Part A: 55 = Normal Human Adults* 


Group Mean 50 Мап Range 
S-RS 2.6 3.5 o 

А o- 
S-RC 2.4 5.6 о © 
Total $ 2.5 4.7 o 
S-RS 11.3 28.1 3 s 
S-RC 8.6 17.3 3 rd 
Total C 9.9 23.3 3 
Part В: Ss = Adult, Cerebro-Vascular Accident Cases - Diffuse Damage** 
Group Mean 50 нап Range 
S-RS 15.58 14.19 15.5 8-25 
S-RC 17.16 8.47 14,0 6-32 
Total S 16.7 7.49 14.8 
C-RC 14.21 10.26 11.5 3-47 
C-RS 18.63 7.80 14,0 -43 
Total C 16.21 10.77 13.1 


EMO ML 
— SSS 


part С: Ss = Neurologically Handicapped Children 
ар — T rng 


Grou 
S-RS 38.5 48.70 0-128 
S-RC 3.5 39.94 0-106 
Total S 35.0 43.02 

C-RC 32.42 28.91 24.0 РЕЯ 
C-RS 28.0 50.30 14,0 0-140 
Total C 30.21 39.42 19.0 
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H. H. Kendler, senior author, and 
on for The Journal of Experimental 
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e first stage of learning the second concept. 
ffectiveness of reversal over nonreversal 
a of Kendler and D'Amato on normal 
n and Gustafson) and neurologically 


handicapped children. These data indicate the effectiveness of learning during 


d concept. It appears that for the neurologically 

th stage of the secon! 

дә -— eins in both the Groups S-RS and S-RC and Groups C-RS 
4 CRC the reversal shift was nof observed to occur at any significantly 

and C-RC, the 5 


effects from the first concept to th 

'Table 4 compares the relative € 
shifts. This table compares the dat 
adults, brain-damaged adults (Phela 
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Table 3 


NUMBER OF TRIALS TO REACH CRITERION OF LEARNING 
DURING THE FIRST STAGE OF THE SECOND CONCEPT 


Ss = Normal Adults 


Group Mean SD Mdn Range 
S-RS 6.0 4.4 4 2-16 

S-RC ал 4.0 3 1-18 

C-RC 5.4 6.2 3 1-26 

C-RS 4.9 3.5 4 1-16 

0-85 12.3 13.1 7 3-50 

O-RC 18.0 24.1 7 2-91 

Ss = Adults-Cerebro- Vascular Accident 

Group Mean SD Мап Range 
S-RS 31.25 10.57 28 19-62 

S-RC 35.47 17.02 26.5 20-73 

C-RC 34.22 21.18 26 14-89 

C-RS 35.23 15.52 31 18-63 

O-RS 30.41 18.22 26 13-83 

о-ас 28.90 16.39 24 10-76 

Ss = Neurologically Handicapped Children 

Group Mean SD Мап Range 
S-RS 35.62 27.39 25.5 2-80 

S-RC 33.13 27.15 44.0 2-74 

C-RC 23.0 18.65 30.0 3-50 

C-RS 21.14 20.98 21.0 1-57 

O-RS 31.83 21.21 29.5 2-62 

0-RC 37.67 34.19 27.0 5-84 


faster rate than the nonreversal shift. These results are consistent with those 
of Phelan and Gustafson in testing brain-injured adults. 

In comparing the two control groups of neurologically handicapped chil- 
dren with their matched experimental groups, no significant information is 
found as to whether reversal or nonreversal shifts resulted in positive or 
negative transfer effects. These results, again, are similar to those of the brain- 
injured adult. 


1 
| 
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Table 4 


NUMBER OF TRIALS TO REACH CRITERION OF LEARNING DURING 
THE SECOND STAGE OF THE SECOND CONCEPT 


55 = 

Group Mean Мап Капде_ E 
S-RS 6.7 25.6 о 0-106 

S-RC 50.4 57.7 20 0-160 (ou 
C-RC зл 8.5 0 0-33 

C-RS 60.3 52.3 40 0-160 < «OL 
0-А5 16.2 39.3 0 0-160 

0-RC 29.4 45.5 6 0-16 

Ss = Adults -Cerebro-vascular Accident 

Grou Mean sD Mdn Range 

S-RS 43.75 39.02 26 16-160 — not signif 
S-RC 52.89 46,37 37 13-160 mob 378010, 
с-вс 32.57 27.04 21 12-106 

C-RS 46.63 43.81 30.5 21-160 < not signif. 
0-RS 56.75 34.78 43.5 27-160 

0-вс 51.0 39.93 37 29-160 


ЕТ ЖЫ Neurologically Handicapped Children 


Grou Mean 50 Yn Range 
S .0 83 1-160 
5 ne e is 58.5 1-160 < not signif. 
— Rc a. 
C-RC ing oT 128 166 < not signif. 
61 22-160 
0-5 75.5 ren 7h 18-160 


O-RC 78.0 


between the control groups of O-RC and O-RS and 
d S-RS differ from those of the normal adult 
y that nonreversal shift produces a demon- 
differ from those of the findings of 
nt with those obtained from brain- 


The comparisons, then, 
the experimental groups S-RC an 
subject with reference to the theor 
strable transfer effect. These data, then, 
Kendler and D'Amato but are consiste: 


damaged adults. 
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D. DiscussioN 


Kendler and D'Amato (5) concluded from their work on concept formation 
in normal adults that human concept behavior could be explained by an S-R 
formulation involving a mediational process. The results obtained from work 
with normal adults demonstrate a superiority of the reversal shift and suggest 
a positive transfer effect. According to the S-R mediational theory, upon com- 
pletion of the learning of the first concept, the symbolic cues appropriate to 
the second concept would be present for the $$ in the reversal group but 
merely connected to the "wrong" sorting response. 

The neurologically handicapped child and quite possibly the brain-injured 
adult manifest consistently wide discrepancies between the results obtained 
from normal adults. The neurologically handicapped child seems to indicate 
an inconsistent functioning mediational process as defined in terms of the 
experimental operation. By comparison, many of the neurologically handi- 
capped children took a large number of trials to achieve the first and second 
concept. Problems with a fixed response, memory impairment, and confusion 
were evident. Unlike the normal adult subjects, there seemed to be no marked 
transfer effects in either direction. This gives rise to an assumption that the 
possibility of an ineffectively functioning mediational process exists in the neu- 
rologically handicapped child. 


E. SUMMARY 


In the neurologically handicapped child, like the brain-injured adults in 
the card-sorting task, a facilitation of card sorting appears to occur even if 
the appropriate symbolic cues are connected to the wrong sorting response. 
The neurologically handicapped child, much like the brain-injured adult, 
does not seem to show a significant difference in the ability to effect a reversal 
shift more rapidly than a nonreversal shift. At the completion of the learning 
of the first concept, the symbolic cues for the second group do not seem to be 
any more present for Ss in the reversal group than the nonreversal group, nor 
do they ever seem to be connected to the wrong response. 

The results which Kendler predicted in reference to positive transfer effects 
were not found. Kendler interpreted his results to mean that the essential 
learning which is basic to performance in these tasks entails a complex asso- 
ciation between external stimuli and overt category responses mediated by 
internal (and probably) verbal activity. Unlike simple conditioning, in which 
practice presumably increases the strength of S-RS connections, training in 
concept problems determines which verbal cues are dominant. If appropriate 


cues are strengthened during training, transfer will be facilitated. Results 


g~ 
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with the neurologically handicapped child and the brain-damaged adult indi- 
cate that there was no significant transfer effect in either direction. It appears, 


then, that there may be 


an impairment in the same mediational process which 


facilitates the reversal shift in the normal adult. 


11. 
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THE STUDY OF SELF-DISCLOSURE IN 
EXPERIMENTAL INTERVIEWS* ! 


Veterans Administration Hospital, Salem, Virginia 


Frep W. VONDRACEK 


A. INTRODUCTION 


A major aspect of interpersonal transactions is the continuous exchange of 
information. In special interpersonal transactions, such as counseling and 


psychotherapy, а large part of this information exchange concerns information 


about the self. 
Jourard (3) has labeled this type of information exchange self-disclosure ; 


he developed the Self-Disclosure Inventory (SDI) for the 
). The host of studies which followed 
the introduction of the SDI attests to the apparent significance of the field 
of self-disclosure. Only relatively recently, however, has the validity of the 
SDI been questioned. Significantly, none of the recent validity studies of 


the SDI were able to confirm its validity (1, 2, 7; 8). 
In addition to this apparent lack of validity of the SDI, which has been 
t in the field of self-disclosure for years, there 


has also been a conspicuous absence of experimental research in this area. It 
will, therefore, be interesting to develop new, preferably behavioral, measures 
of self-disclosure, and to attempt the application of appropriate experimental 


methods in the study of this important variable. 
B. METHOD 


1. Subjects 


and, subsequently, 
empirical study of self-disclosure (5 


the major research instrumen 


males studying introductory psychology were recruited 
would be interviewed. The interviewers were 
hology graduate students. Each interviewer 
a reflecting, and a revealing interviewing 


Sixty undergraduate 
for an experiment in which they 
five male and five female psyc 
interviewed two Ss each in a probing, 
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technique. The interviewing techniques were defined by the interviewer's 
verbalizations, and interviewers were trained in the use of these techniques 
prior to their first actual interview. 


2. Procedure 


All interviews were transcribed on magnetic recording tape. After the 
initial introductory statement by the interviewer, which was identical in all 
three techniques, the interviewers observed a three-minute period during 
which they held their verbalizations to an absolute minimum. The Ss’ verbal- 
izations during this period were used as base-rate data in the statistical anal- 
yses. After this three-minute period, depending on which technique the inter- 
viewer was using, a probing, a reflecting, or a revealing statement was made 
at specified intervals. If the Ss did not terminate the interview after the initial 
20 minutes, the interviewer was instructed to do so. 


3. Measurement of Self-Disclosure 


Two parameters of self-disclosure—amount and intimacy—were measured 
separately. The amount was determined by timing the S's verbalizations 
during the interview, carefully excluding periods of silence and interviewer 
statements. Intimacy of self-disclosure was judged independently by two 
judges, in each case from the first two pages of the typewritten interview 
transcripts. The ratings were made on a Likert-type scale, representing a 
theoretical continuum ranging from a score of 1 (least disclosing) to a score 
of 7 (most disclosing). Both amount and intimacy were measured separately 
for the base-rate periods and the experimental periods. Transcripts were 
disguised so that the judges were unable to determine which interviewing 
technique had been used in any given interview. At the conclusion of each 


interview, the Jourard Self-Disclosure Inventory was administered by an 
assistant. 


C. RESULTS 


Product-moment correlations were calculated to assess interjudge reliabil- 
ities. Correlations were computed separately for the judgments of intimacy of 
the base-rate periods, the treatment periods, and the two periods combined. 
The coefficients ranged from .71 to .76. 

A preliminary analysis of covariance of the amount of disclosure scores 
revealed that neither the individual interviewers, nor the interviewer's sex 
accounted for a substantial amount of the variance. It was found, however, 
that the interviewing techniques significantly affected the amount of self-dis- 
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closure produced by the Ss (Table 1). A Tukey Gap Test revealed that the 
probing condition was most effective in eliciting amounts of self-disclosure, 
while no significant difference existed between the reflecting and the revealing 
techniques (Table 2). 


TABLE 1 
ANALYSIS OF COVARIANCE OF AMOUNT OF DISCLOSURE SCORES 
Source of variance df Mean square F 
Techniques 2 158644.84 3.74% 
Еггог 56 42387.39 
Total df 58 
* p< .05. 
TABLE 2 
MEANS AND STANDARD DEVIATIONS OF AMOUNT OF DISCLOSURE ScorEs* 
Measure Probing Reflecting Revealing 
x 625.00 458.00 424.75 
ТБ 182.0+ 228.72 274.52 


t: Critical value — 147.57, ? — .05. 


Note: Tukey Gap Tes i 
t is seconds, 


* The unit of measuremen 

Analysis of covariance of the intimacy ratings indicated no differences in 
the intimacy of the disclosures made by any of the experimental groups. How- 
ever, the intimacy ratings of the treatment periods were much higher (more 
intimate) than the intimacy ratings of the base-rate periods (5 < .001). 

Finally, the correlations between the measures of self-disclosure used in 
the present study and the SDI ranged from .10 to .20, and were not significant. 


D. Discussion 


The experiment was designed to study the differential effectiveness of three 
interviewing techniques in eliciting self-disclosure. Furthermore, it was at- 
tempted to develop behavioral measures of self-disclosure. 

One interesting finding suggested that a probing interviewer elicits sig- 
nificantly greater amounts of self-disclosure than a reflecting or a revealing 
interviewer: ‘This apparently contradicts the findings of Jourard and Lands- 
man (4) and Jourard and Richman (6), who reported evidence suggesting 
a dyadic effect in the disclosing behavior of individuals, in 


the operation of in the sing ) 
that disclosure by one individual will in turn lead to reciprocal dis- 
closure by the other member of the dyad. | 

Tuckman (9) has suggested a hypothesis which may account for the findings 


of the present study. He suggested that a probing technique, being more 


such a way 
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direct and instrumental than a revealing technique (or a reflecting technique), 
should be more effective in eliciting self-disclosure; since the revealing tech- 
nique operates primarily as a stimulant to disclosure, and appears to be much 
more passive and indirect than the probing technique. It could be speculated, 
on the other hand, that these differences hold up only in initial or short-term 
contacts, and that in long-term interpersonal relationships the “dyadic effect” 
develops gradually. 

The failure to discover systematic differences in the judged intimacy of the 
interviews appears also to be related to the relative briefness of the encounter 
between the Ss and the interviewers. It appears that intimacy may increase 
as the duration of the encounter increases. The significant increase in rated 
intimacy between the base-rate periods and the experimental periods may 
partly reflect this increase in intimacy as a result of the increased duration 
of the relationship, although it probably also reflects the sudden increase in 
the interviewer's verbalizations at the conclusion of the base-rate period. 

More generally, the present study demonstrates the feasibility of experi- 
mental research in the area of self-disclosure. Both measures of self-disclosure 
used in this study were found to have satisfactory reliabilities (10). Inter- 
estingly, their correlations with the SDI, a self-report measure, were rather 
low. This may be due partly to the fact that the SDI confounds the two 
parameters of self-disclosure—amount and intimacy—swhich were measured 
separately in the present study. А 

The preceding conclusions should not be interpreted to suggest that the 
SDI should be abandoned as a research instrument. However, it would seem 
advisable to examine further the relationship between the SDI and the behav- 
ioral measures of self-disclosure introduced in this study. Furthermore, the 


experimental study of self-disclosure would appear to be the tactic of choice 
in studying this much neglected aspect of interpersonal transactions. 


E. SUMMARY 
. In order pede the differential effectiveness of three interviewing techniques 
in eliciting = "disclosure, 60 male Ss were interviewed either by a probing a 
reflecting, or a revealing interviewer, Self-disclosure was defined as consisting 


of two parameters—amount and intimacy—and measures were developed t? 


ааны e, Ане of covariance suggested that the probing condition was 
Loue ot E: high quantities (amounts) of self disclosure, an 
that none of the interviewing techniques produced differences in the rate 
intimacy of the Ss’ disclosures. The shes 

approach and valid and reliable me 


stressed. 


n a 
nce of developing an experiment: 
asures in the area of self-disclosure was 


N 
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AN ANALYSIS OF THE INSTRUCTIONAL EFFECTIVENESS OF 
VISUAL ILLUSTRATIONS PRESENTED VIA TELEVISION* 
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Francis M. Dwyer, Jr. 


A. INTRODUCTION 


The use of varied visual illustrations to complement oral instruction pre- 
sented via television is rapidly gaining wide acceptance at all levels of instruc- 
tion. However, there is very little experimental evidence available as to how 
learners react to the various types of visual illustrations and to the variations 
in the amount and kind of realistic detail which they contain. In some cases 
it may be that the types of illustrations used to complement oral instruction 
may actually be hindering the learning process rather than facilitating it. 
Consequently, it seems necessary that guidelines be developed which will 
enable educators to select those types of visual illustrations for their televised 
presentations which will be most effective in promoting student achievement 
of desired educational objectives. A study by Dwyer (2) indicated that when 
students interacted with visually complemented programed frames for as long 
as they felt necessary to comprehend the information presented, different types 
o£ visual illustrations facilitated student achievement with different degrees of 
effectiveness. 

The purposes of this study were (a) to measure the relative effectiveness 
of varied visual treatments possessing different amounts of realistic detail 
are viewed for equal amounts of time, and (5) to determine 


when the visuals k | 
lementing oral instruction presented 


which type of visual illustrations complemer 
via television were most effective in facilitating student achievement on five 


criterion measures. 
В. METHOD 


1. Subjects 


One hundred and sixty-two students in Speech 200 classes at The Pennsyl- 


vania State University were used as subjects. 


* i in the Editorial Office, Provincetown, Massachusetts, on February 7, 
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2. Treatment Group 


Students in Group (Grp) I, the control group, received no illustrations of 
the heart, but viewed slides containing the names of the facts and processes 
of the heart as they were mentioned orally. Grp II viewed simple line repre- 
sentations of the heart. Grp III viewed detailed, shaded drawings. Grp IV 
viewed photographs of a heart model, and Grp V viewed realistic heart photo- 
graphs. 


3. Procedure 


Students were placed in treatment groups by indicating which of the five 
testing sessions they would be able to attend; the experimental treatments 
were then assigned at random to the five groups. The audio presentation con- 
sisted of a 17-minute audio-taped presentation which described the heart, its 
parts, its internal functioning, and the simultaneous processes which occur 
during the systotic and diastolic phases. Each visual treatment contained 39 
black and white slides which were synchronized with the audio-taped pre- 
sentation, so that the slides presented visually relatively the same information 
that was being presented orally. Each instructional presentation was video- 
taped on an Ampex 660 B, and later presented to the respective treatment 
groups by means of the Telebeam Model A 912-A Television Projector which 
projected a 5 X 3-foot image. All groups received the same oral instruction, 
and viewed their respective type of visual illustrations for equal amounts of 
time. 

4. Criterion Measures 


Each student in each treatment group received a pretest, participated in 
his respective instructional presentation, and then received four individual 
criterion tests. Scores received on these tests were combined into a 78-item total 
criterion test. The objective of each test was as follows: (a) drawing test—to 
evaluate learning of specific locations of the various patterns, structures, and 
positions of the parts of the heart; (b) identification test—to measure transfer 
of learning: i.e., the ability to identify numbered parts on a diagram of the 
heart from information received in the instruction; (c) terminology test—to 
evaluate student knowledge of referents for specific symbols; (4) compre- 
hension test—to measure understanding of the heart, its parts, and internal 
operations; and (e) total criterion test—to measure the student's total under- 
standing of the concepts presented. The Kuder-Richardson Formula 21 relia- 
bility coefficients for the five criterion measures were: (a) drawing test .84; 
(5) identification test .81; (c) terminology test .81; (4) comprehension test 
.76; and (e) total criterion test .93. 


_- 
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C. RESULTS 


The Hartley test for homogeneity of variance (6, pp. 94-95) performed on 
the pretest scores achieved on the Quick Scoring Mental Ability Test indi- 
cated that the treatment groups were drawn randomly from populations with 
common variance. The F-ratio resulted from the analysis of variance indi- 
cated that significant differences existed among the treatment groups on the 
drawing test (F = 3.32, df = 4/161, р < .05). No significant differences 
existed among the means of the remaining four criterion tests. 

Comparisons among the individual means of the five treatment groups on 
the drawing test were conducted via Tukey’s Й Procedure (3, 4). The results 
indicated that the students viewing the abstract line presentation (Grp II) 
and the detailed drawing presentation (Grp 111) achieved significantly 
higher scores than did those students receiving the oral presentation without 
visuals (Grp II > Grp I, 7 = 2.80, п/о = 4/162, р < 05; Grp III > 
Grp I, W = 2.66, n/v = 4/162, p < .05). 


D. Discussion 


The results indicate that the use of visual illustrations to complement oral 
instruction is not an effective way to facilitate student achievement of all 
types of educational objectives. A number of possible explanations may be cited 
to explain the results obtained in this study: (а) since college students are 
generally selected from the upper two-thirds of the population in terms of 
verbal and conceptual ability, it seems that they are in a highly favorable 
position in terms of being able to learn from oral instruction; (2) the initial 
impact of the realistic detail in the visuals may have had the net effect of dis- 
ents from the essential learning cues in the oral and visual 
student achievement of the types of learning objectives 
measured by the identification, terminology, comprchension, and total criterion 
test are not facilitated by visual cues similar to the ones used in this study ; 
(d) since students in each treatment group viewed their respective televised 
ation for equal amounts of time, those students. who viewed the more 
realistic types of visuals may not have had sufficient Mme © study and com- 
prehend adequately the additional information contained in the visual illus- 
trations presented to them. The effectiveness of the line drawings seems to be 
in accordance with Attneave’s research (1) which contends that only essentials 
travel through the nervous system to the brain and that lines bordering objects 
provide the essence of the information to be conveyed. Travers and his col- 
leagues (5) have stated that visual information is stored in the nervous system 
in some form isomorphic with line drawings, and that this permits the individ- 


tracting the stud 
presentations; (c) 
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ual to organize, remember, and reproduce edited information more easily than 
relatively realistic information. 


E. SUMMARY 


A number of important generalizations emerge regarding the use of visual 
illustrations to complement oral instruction presented by means of television. 

1. The use of visuals to illustrate subject content is not an effective means 
of improving achievement by college students of all types of educational objec- 
tives when students view their respective visuals for equal amounts of time. 

2. 'The data indicate that the instructional effectiveness of visual illustra- 
tions is a function of the type of learning task they are designed to complement. 
"This would seem to imply that each television instructor will have to define 
his objectives behaviorally and then select the appropriate visuals that have 
been found to promote most effectively student learnings of those specific 
objectives. 
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A. INTRODUCTION 


While in the past few years an abundance of research has been done on the 
effects of stress on performance (8), comparatively little work has been done 
to determine what personality dimensions influence people’s susceptibility to 
these effects of stress. After Witkin’s (11) description of field dependence- 
independence was considered, it seemed that this might be one of those 
personality dimensions. The cognitive styles associated with field dependence- 
independence have been described as follows: “At one extreme (field depen- 
dence) there is a consistent tendency for experience to be global and diffuse; 
the organization of the field as a whole dictates the manner in which its parts 
are experienced. At the other extreme (field independence) there is a tendency 
for experience to be delineated and structured ; parts of a field are experienced 
as discrete and the field as a whole organized" (11, p. 319). 

It seems plausible then that field independent persons in a stressful situation 
than field dependent persons to separate and ignore the 
li and focus on task relevant cues. If this were 
hould perform better under stress than those 
ted here represents an attempt 


might be better able 
context of stressful stimu 
true, field independent persons shou! 
who are field dependent. The experiment repor 


to investigate this possibility. 


B. METHOD 


The general approach was that two groups of Ss, a field independent (FI), 
and a field dependent (FD) group, performed three tasks first inquiet: [Оў 
and later in a distraction condition (D). die two groups’ performance я 
the three tasks in D (after adjusting for Ss' performance in Q) was then 


evaluated. 
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Auditory distraction was chosen as a representative of many possible stressful 
situations (4). The three tasks were selected with the expectation that they 
would be sensitive to the effects of stress. They were Digit Span, Anagrams, 
and the Stroop Color-Word Interference Test. Digit Span (DS) was selected 
because it has been shown to be adversely affected by stress ( 10). "The task was 
made additionally difficult by requiring Ss to repeat the digits in the reverse 
order in which they had heard them. A 16-item Anagrams task (AGM) was 
chosen, since it is difficult and requires fairly constant attention. It was thought 
that these qualities would make the task vulnerable to the distracting effects 
of noise. The Stroop Color-Word Interference Test (CW) was used because 
it was thought that distraction would increase the interference involved and 
thereby have deleterious effect on Ss’ performance. In the CW task, four 
color names—red, green, blue, and orange—are repeated randomly on one 
sheet of paper for a total of 176 color names. Each color name was printed 
in one of the three colors other than the hue itself: e.g., the word “red” 
might be printed in blue, green, or orange ink. 8 was asked to name as rapidly 
as possible the color of ink in which each word was printed. 

Scores on the Embedded Figures Test, EFT (6), an anxiety questionnaire 
(2), need achievement scale (3), and the American College Testing Program 
Examination (ACT) were obtained from 99 males and 95 females. On the 
basis of EFT scores, a field independent group (Fl) and a field dependent 
group (FD) of Ss were formed. The two groups were then equated on four 
variables that are possibly related to task performance in stressful situations: 
anxiety level, need achievement, intelligence (as measured by the ACT), and 
grade-point average. The final groups, each consisting of 20 females, performed 
the three tasks—DS, AGM, and CW—initially in Q. Three weeks later, Ss 
again performed the three tasks, this time while wearing earphones? through 
which tape-recorded noises were played at a high but not painful intensity. Ss 
were presented with two different forms of the tasks on the two occasions, the 
order of which was counterbalanced across Ss. The order in which the three 
tasks were performed was also counterbalanced across Ss. 


Four scores were derived from the three tasks and constituted the dependent 
measures. The scores obtained were time and error scores for CW, number of 
correct digits recited backwards for DS, and the number of Anagrams cor- 
rectly solved for AGM. 


m 


2 Only one earphone was worn for the DS task in order to allow $ to hear Ё 
present the digits. 
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C. RESULTS 


The correlation between EFT scores and scores for anxiety, need achieve- 
ment, and ACT for the entire sample of 99 males and 95 females are given 
in Table 1, The only correlation which was significant was that between EFT 
and ACT. The correlation between these variables was significant and similar 


in magnitude for both sexes. 


TABLE 1 
EEN EFT Scores AND SCORES FOR ANXIETY, NEED ACHIEVEMENT, 


CORRELATIONS BETW 
AND ACT For MALES AND FEMALES 


Sex Anxiety EFT and n Ach ACT 
Males +14 +.09 --.29** 
Females —0* +.07 +.25* 

* p< .02. 

+ p c .01. 


There were no significant differences between the two forms of the three 
tasks for either of the Q or D conditions. The two forms of each task were 
therefore considered to be equivalent. 

The performance of Ss in FI was slightly superior to Ss in FD on three 
out of the four measures—viz., DS, AGM, and CW time—but was slightly 
poorer on CW Errors. An analysis of covariance (9) on the task measures 
in D, adjusting for performance differences in Q, revealed that none of these 


group differences approached statistical significance. 


D. Discussion 


That a correlation was obtained between EFT and intelligence as mea- 
sured by ACT is consistent with Witkin, Dyk, Faterson, Goodenough, and 
Karp’s (12) report that field independence correlates with intelligence. No 
correlation for either sex was found for need achievement or anxiety. The 
latter is in general accord with Bound (1) who reported that field dependence- 
independence was not related to measures of neuroticism, 

The failure to obtain significant differences in task performance between 
FI and FD indicates that field independence is not linearly related to per- 
formance in distraction specifically and possibly performance in stress gen- 
erally. This finding is in agreement with the work of Karp (7) and Holtz- 
man and Bitterman (5). Karp (7) reports that field independence is not 
related to the capacity to ignore distracting stimuli in performing a task. The 
distracting stimuli Karp studied, however, were in the same—i.e., visual— 


68 JOURNAL OF PSYCHOLOGY 


modality as that of the task relevant stimuli. The findings of this study indi- 
cate that Karp's conclusions may be extended in regard to the modality of 
the distracting stimuli. That is, field independence is not related in a linear 
fashion to the capacity to ignore distracting stimuli regardless of the modality 
in which they occur. 

Holtzman and Bitterman (5) in a factor analysis of a wide range of mea- 
sures relating to adjustment to stress included a vertical orientation task 
similar to one of Witkin's measures of field independence which correlates 
with EFT performance. The vertical orientation test did not load on the 
same factor as performance under stress. 

One methodological criticism should be noted here. The effects of distrac- 
tion and retesting are confounded in the D condition. A comparison group 
which performed the tasks in quiet the second, as well as the first, time would 
allow these effects to be separated. Since, however, no significant differences 
were obtained and the data are consonant with other findings, the lack of such 
а comparison group does not seem crucial. 


E. SUMMARY 


It was hypothesized that field independent persons would perform better 
in a stressful situation than field dependent people. A group of 20 field inde- 
pendent female 8s and a group of 20 field dependent female Ss performed three 
tasks: Digit Span, Anagrams, and the Stroop Color-Word Interference Test, 
in both quiet (Q) and auditory distraction (D) conditions. Both had been 
equated on anxiety level, need achievement, intelligence test scores, and grade- 
point averages. Ап analysis of covariance for the scores on the three tasks ob- 
tained in D, adjusting for initial performance in Q, did not reveal any sig- 
nificant differences between the two groups. The findings of this study were 
considered as consonant with the reports of other investigators. 
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A. INTRODUCTION 


Recent cross-cultural research points to significant differences in symptom- 
atology and dynamics of abnormal behavior among various ethnic groups. 
Italian and Irish descent have attracted the atten- 


In particular, patients of 
21). While germane to present cross- 


tion of many investigators (16, 18, 2 
cultural interest, this study shifts its focus from the pathological to the normal 


pole of the behavioral spectrum. In addition, it was designed to test some 
of the theoretical assumptions and research results on psychocultural dimen- 
sions of achievement motivation presented by McClelland (13, 14, 15), Atkin- 
son (3), Rosen (22), and Strodtbeck (27). Among the basic conclusions 
reached by these authors, the following are particularly pertinent to the 
present investigation: (a) certain factors exist in early family structure and 
subsequent cultural conditioning that lead to variations in achievement moti- 
(b) these variations seem to persist during the process of assimilation 
ironment ; (c) achievement motivation is positively rela- 
acculturation and to the rate of mobility of a specific 
s within the same culture; (d) in the American 
with the same sex parent. 


vation ; 
to a new cultural env 
ted to the degree of 
ethnic group relative to other 


milieu high achievers tend to identify c ( 
The specific purpose of this study was, then, to investigate the effect of 


ethnic background on achievement motivation, parental identification, and 
religious ideology in male college students of Italian and Irish extraction, 
who were classified into High Academic Achievers (HAA) and Low Aca- 
demie Achievers (LAA). It was hypothesized that, since high achievement 

he degree of acculturation, the difference 
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sured should be less than that between the two Low Academic Achievers 
groups. Hence, the two HAA groups should reflect their original value sys- 
tem to a lesser extent than the LAAs. The rationale for selecting members of 
these two ethnic groups was that they present significant differences with 
respect to the specific psychocultural dimensions of achievement motivation 
as stated above. Each group differs in relation to the time of immigration into 
the United States and consequently to the length of acculturation to the Amer- 
ican environment (1). There is some evidence that the mobility rate of the 
Italians is at present higher than that of the Irish (28). The original family 
structure and value system of the two groups, particularly the religious ideol- 
ogy, present contrasting patterns and characteristics. 


B. METHOD 
1. The Sample 


From a total of 493 freshmen and sophomores in an all-male, Eastern Jesuit 
Liberal Arts College, 60 pairs of subjects were matched on the basis of intel- 
ligence, but differed in academic achievement. Each group of 60 High (HAA) 
and 60 Low (LAA) Achievers consisted of 30 students of Italian descent 
(Southern Italy) and 30 students of Irish descent, all comparable in socio- 
economic background, All subjects were Catholics, of approximately the same 
age (18-20), and enrolled in the College of Arts and Sciences. 

Students who reported at least one paternal and one maternal grandparent 
born in Southern Italy were considered Italians. Students with one or more 
paternal or maternal grandparent born in Ireland were considered Irish. Se- 
lection was made from freshmen and sophomore classes which were exposed 
to the same basic courses of instruction. 

Upon entrance into the College, each student was given a college ability 
test (SCAT). SCAT scores for the 493 students ranged from 274 to 336, 
with a mean of 312 (SD = 9; median = 312.4). Selection of subjects was 
made from SCAT scores between 312 and 330. Differentiation of HAA and 
LAA subjects was made on the basis of actual performance through the first 
semester of college, since recent studies have indicated that the first semester Gen- 
eral Point Average (GPA) is one of the most reliable predictors of academic 
performance in the four years of college (8). Sophomores whose academic 
achievement in the second and third trimester differed significantly from that 
of the first semester were eliminated from the sample. An index of achieve- 
ment was established by comparing the percentile rank of each student's SCAT 
score with his percentile rank in academic grades during the first semester of 


ANGELO DANESINO AND WILLIAM A. LAYMAN 73 


college. HAAs showed academic achievement within the upper third of the 
class, and were performing by at least 4-10 above expectancy as measured 
by the index. LAAs showed achievement in the lower third of the class, and 
were performing at least —10 below expectancy, as measured by the index 
On the basis of ¢ tests that proved insignificant the HAA and LAA ornis 
were adequately matched for intelligence, but were significantly different 
in academic achievement. Tests for homogeneity of variance among the groups 


yielded no significant results. 


2, Measuring Instruments and Testing Procedure 
The test battery was administered in & single afternoon and in the follow- 


ing sequence: 
a. McClelland Achievement Motivation Test (14). Four pictures were 


used with standard instructions (14) for each subject tested under neutral 
conditions. They were: 1) a man working in a chemistry laboratory; 2) а 
boy with a vague operation scene in the background (TAT card 8 BM); 
3) a тап skiing down а steep hill; and 4) an original picture of a student їп 


three-quarter profile, sitting at à desk reading a letter. 

'The scoring system used in this study was а slightly modified form of 
McClelland's System G (155 which yields a total need achievement score. 
Procedures and a detailed description of each scoring category are given. by 
McClelland e£ al. (1% 15). A correlation of .89 was found for 45 randomly’ 
the authors and another qualified psychologist. 3 


selected protocols scored by 
b TAT [nvolvement Score. Degree of parental involvement was measured 


by stories written in response to two 'TAT cards (6 BM—older woman and 
sans 7 BM—older man and younger man). The two pictures were pro- 
fected immediately after the McClelland test with standard TAT instruc- 
Hans ai with @ five-minute time limit for writing cach story. Each story 

ix-point scale devised by Bieri, 


was scored for parental involvement on а St 
Lobeck, and Galinsky (5). Each subject was scored F (father) or M 


(mother) depending upon which story produced the highest involvement 
core of zero was given if the two scores were equal. Quantitative 
ssigning а value of +1 to the father and —1 to the 
dependently scored by the authors and another quali- 


as 9] percent scoring agreement. 
ix indices pertaining to the scoring categories of 


the McClelland Achievement Motivation Test were chosen for particular 
s study. They were (a) G+ and G— (positive and negative 
d Bp (environmental and internal obstacles) ; and 


score. A 5 
Scores were derived by a 
mother. Of 50 stories in 
fied psychologist, there w 

c. Ranking of Themes. 5 


analysis in thi 
affective states) ; (4) Bw an 
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(c) SR and LR (short range goals and long range goals). The rationale for 
the selection of these variables was based upon the assumption that they repre- 
sent contrasting personality patterns of Italian (G+; Bw; SR) and Irish 
(G—; Bp; LR) subjects (11). 

Eight different themes selected from actual protocols of the McClelland 
Motivation Test and expressing all the combinations of the three pairs of 
variables taken one at a time were ranked by the subjects. The same four slides 
of the McClelland Motivation Test, as well as the same testing procedures, 
were used. Three minutes were allowed for the ranking of the eight themes 
for each slide. To obtain numerical scores, each subject arranged the themes 
on each slide from one to eight. Thus, by totalling the ranks of the four 
themes, concerning each of the six variables on each slide, a score was obtained 
for each subject for each variable on each slide. Each slide was scored sepa- 
rately. The total of the ranks for each subject was 36. For example, if the 
subject’s score for variable G was less than the median of 18, he was rated 
a G+, if it was greater, he was rated a G—. This process was repeated for 
the B and R indices for each slide. Subjects who scored 18 for a given variable 
were eliminated. 

d. Fichter Religious Scale. The Fichter Religious Scale (9), in a slightly 
modified form, was used to measure the extent to which the groups being 
tested agreed with the official position of the Catholic Church on certain moral, 
social, and religious questions. Five items taken from Schuyler's Religious 
Questionnaire (23) were added to Fichter’s Scale for a total of 20 items; 
10 of these were related to dogmatic and authoritative matters which require 
strict Catholic adherence, and 10 were nonauthoritative items which could be 
subjectively interpreted, Subjects answered each item with an agree (А) or 
disagree (D) response within a total time limit of four minutes. It was 
emphasized that the items merely represented a wide range of opinions, Each 
response indicating adherence to Church ideology was given a score of one; 
each nonadherence response was scored zero. The median for all HAAs on 
the authoritative scale was first determined. АП Italian HAAs whose scores 
were below the median were classified as nonadherence ; all whose scores were 
above the median were classified. as adherent. The process was repeated for 
the Irish HAAs. Similarly, the median of the LAAs was determined on the 
authoritative scale and the Italian and Irish were classified as either adherent 
or nonadherent. Frequencies on the nonauthoritative scale were established 
in the same way. 

е. Osgood's Semantic Differential. "The Osgood Semantic Differential (20) 
was used with standard instructions and a four-minute time limit to measure 
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the perceived similarity aspect of identification. A list of nine pairs of bipolar 
adjectives, to be rated on a seven-point scale, was employed. The subject rates 
himself, his father, and his mother on each of the nine dimensions. А high 
nt indicates a perception of greater dissimilarity between 
w score reflects a greater similarity. А per- 
ceived similarity with the father is scored by an F, with the mother by an M. 
By subtracting the M. score from the F score, a difference score (D) is ob- 
tained. If the D is positive, the subject perceives himself as more like his 
he sees himself as more like his father. If D is zero, the 
qually similar to both parents (5). A quanti- 
a value of +1 to the father and —1 to 


score for either pare 
the subject and the parent; a lo 


mother ; if negative, 
subject perceives himself to bee 
tative score was derived by assigning 


the mother. 
f. Similarity Perception Questionnaire. The Similarity Perception Ques- 


e was used as another measure of parental identification. А special 


booklet was distributed to the subjects who were asked to respond to each 
question in as detailed a fashion as possible within a five-minute time limit. 
The content of the Questionnaire was suggested by the interview schedule 
used by the Funkenstein Biographical Questionnaire (10). In agreement with 


the procedure of Bieri et al. (5) the measure of similarity perception was 
based upon the answer given by each subject to the following question: 
Which parent do you feel you are most like at the present? If one parent was 
clearly indicated, a score of either F (father) or M (mother) was given; if 
both parents were mentioned, but one was given greater weight, a score of 
either F/M or M/F was given, and these were considered F and M responses 
respectively ; if neither parent was relevant to the question, а score of zero was 
assigned. Quantitative scores were derived by assigning a value of +-1 to the 
father and —1 to the mother. | 

The measure of parental involvement Login upon the answer given by 
each subject to the following question: TF hich — did you feel closer to 
as a child? (4, 5). ‘The question was intersperse with the other items of the 
Similarity Perception Questionnaire. The same scoring procedure outlined 


above was applied. 


tionnairi 


C. ResuLTs AND DISCUSSION 


Le McClelland 


f the tests between 


Achievement Motivation Test 
the two ethnic groups of High Achievers 


Low Achievers (LAA) are presented in Table 1. Italians 
higher scores (p < 01) than did Irish subjects in the HAA 
he LAAs, Irish subjects had significantly higher 


Results 0 
(HAA) and of 
had statistically 
pairings. However, among t 
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TABLE 1 
Test or SIGNIFICANCE BETWEEN ITALIAN AND IrisH HicH (HAA) алко Low (LAA) 
ACADEMIC ACHIEVERS FOR MCCLELLAND MOTIVATION TEST SCORES 


Mean 
McClelland 
score SD t 
2 9.30 3.71 2.85% 
Irish HAA 6.90 2.66 
Italian LAA +70 2.30 2.26* 
Irish LAA 6.17 2.77 
* p« 05. di 
+<. 


scores (№ < .05). A test of the difference between the differences of the Italian 
and Irish HAA and of the Italian and Irish LAA subjects yielded insignificant 
results. Thus, Italians tended to score at the extremes in motivation, Reports 
by McClelland (13), Rosen (22), and Strodtbeck (27) on the effect of ethnic 
and sociocultural factors on achievement motivation offer a possible interpre- 
tation for these findings, Italians, because of their more divergent ethnic back- 
ground from the American culture and a greater language barrier, seem to have 
an initial disadvantage with respect to the Irish in the process of assimilation. 
Thus, Italians may more readily either accept or reject the challenge of their 
present sociocultural environment, The Italian HAAs, in order to overcome 
their greater initial handicap, must be more highly motivated than the Irish 
HAAs. Italian LAAs may not be able to free themselves so completely from 
previous sociocultural influences, which may expect modes of achievement 
different from those of the American environment. This different reaction of 
the Italian HAAs and the Italian LAAs could explain, at least partially, the 
differences in their achievement motivation level with respect to the two 
Irish groups. 

Achievement motivation may also be related to the rate of status mobility 
in the two ethnic groups (6, 7). Although it has been found that Irish have 
a greater mobility than Italians (30), there are indications that the present 
Italian population has a relatively greater rate of acceleration with regard 
to status mobility than the Irish (28). Thus, the higher motivation of the 
HAA Italians could reflect this greater mobility rate, while the LAA Italians 
might be continuing at the status mobility level characteristic of early Italian 
immigrant groups. Concerning the often raised question as to whether mobility 
is a function of need achievement or vice versa (6, 7, 12), the analysis of the 
data point to a complex, multiple interaction including not only these two 
variables, but other psychocultural factors as well. 


- 


ANGELO DANESINO AND WILLIAM A. LAYMAN 77 

The intervals between the motivation mean scores of the LAA and the 
HAA groups were similar. Furthermore, the mean scores for all four groups 
were relatively low when compared to the maximum possible score, Further 
research should explore reasons for (4) the relatively large difference between 
the Italian and Irish HAA subjects, and (b) the general low motivation 
scores of this sample of Catholic college students which contrast with recent 
findings that indicate an increase in achievement motivation of Catholic 
as compared with non-Catholic groups (29). 


2. Ranking of Themes 


For each of the three sets of dichotomous variables, chi-square tests were 
used to evaluate the performance of the groups on each of the four slides 
(Table 2). For the G variable, the Italian HAAs had significantly greater 
G+ scores than did the Irish HAAs on slides 1, 3, and 4. Similarly, Italian 
LAAs produced greater G- scores as compared with Irish LAAs on slides 1, 
2, and 3. There were no significant differences between the High groups on 
slide 2, nor between the Low groups on slide 4. For the B variable, there were 
t differences between Italian and Irish HAA subjects on any 
an LAAs showed a significantly greater concern than 
ternal obstacles related to achievement goals on slides 
ble yielded no significant differences between Italian 

slide. However, Italian LAAs demonstrated a 
ge (SR) quality in achievement goals, as compared 


no significan 
slide. However, Itali 
did Irish LAAs for ex 
1, 3, and 4, The R varia 
and Irish HAAs on any 
greater degree of short ran 
with the Irish LAAs, on slides 1 and 3. 
TABLE 2 

Сні-500 IFFERENCES BETWEEN ITALIAN AND IRISH HIGH 

ue coe VER: EEN ITALIAN AND IrisH Low ACADEMIC 
ACADEMIC ACE ЕС Тона m dons Dinos m 

destin quo RANKING or THEMES TEST 


Scene 
Ski . 

Variable Chemist » Student kier | A Operation 

a d .86 10.40** ]1.52*** 
HAA nx 14.16t** 30.75*** 1.01 
LAA 36. 

B 1.04 03 E 
‘ene 39 1.67 13.65%®% 12.54*** 

А ] 

R 42 .88 2.93 
Has spose 1.19 18:75988 O Do 
LAA i E 

nina r eE ——— 

«e p< 01. 
ave р c 001. 
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The difference between the differences of Italian and Irish HAAs and of 
Italian and Irish LAAs on each variable for each slide was subjected to tests 
of significance (Table 3). For all three variables on slides 1 and 3, the differ- 
ences between LAAs were significantly greater than the differences between 


TABLE 3 
RESULTS OF ¢ TESTS OF THE DIFFERENCE BETWEEN DIFFERENCES FOR ITALIAN AND IRISH 
HIGH ACADEMIC ACHIEVERS (HAA) AND ITALIAN AND IRISH Low ACADEMIC 
ACHIEVERS (LAA) on G, B, AND R VARIABLES FOR EACH SCENE 
USED IN THE RANKING OF THEMES TEST 


Differences between means 


: Italian and Italian and 
Variable Scene Irish HAA Irish LAA 1 
G 1 2.96 6.43 3:407 
2 2.03 3.00 +65 
3 2.70 5.33 2.50* 
4 3.96 ‚73 2.23* 
B 1 .33 5.93 549% 
2 .67 .77 94 
3 10 2.53 2.27% 
4 24 3.80 2.59** 
R 1 66 3.73 4.2308 
2 24 97 59 
3 47 3.10 2.68** 
+ 87 27 .09 
W Ж.О, 
же Sc 1, 
"= е Ий. 


HAAs. Slide 2 yielded по significant effects between those pairings. For 
slide 4, the difference between the HAA groups was significantly greater for 
the G— variable, the difference between the LAA groups was greater for the 
B— variable, and there were no significant effects found for the R— variable. 

"These results may be due to (a) difference of the unique modal personality 
patterns in each ethnic group, and (b) higher degree of acculturation in the 
HAA group than in the LAA group. А possible explanation of the predomi- 
nance of G+ for both Italian groups is that this index reflects the commonly 
observed ethnic characteristics of extreme emotionality and life enjoyment, as 
well as optimism and realism with respect to Italians (11). These traits are 
in opposition to the commonly observed Irish characteristics of emotional in- 
hibition, pessimism, idealism, and fantasy life (24). The extreme poles (G++ 
for Italians and G— for Irish) for the affective state continuum appear to be 
the most distinguishing features between the ethnic groups. The above differ- 
ences persist in the assimilation process and were even found in the psycho- 
pathology of patients of Italian and Irish descent (19, 26, 31). On the other 
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hand, no significant difference between the НАА groups was found with re- 
spect to the B and R characteristics. It would seem that, with reference to 
external and internal barriers and to short and long range goals, the Italian 
and Irish HAAs have been assimilated to a point where there may no longer 
be any distinguishable difference relative to these traits. With LAA subjects, 
however, ethnic differences were found between the two groups with respect 
to all three variables. Thus, it seems that distinctive ethnic characteristics are 
neated between Italian and Irish LAAs. On the other hand, 
the HAAs seem to be more acculturated to the American scene than the 
LAAs. This suggestion is based on the fact that differences between LAAs 
differences between HAAs with respect to all three vari- 


more clearly deli 


were greater than 
ables. 
3. Fichter Religious Scale 

calculated for the high and low achievers with 
regard to adherence to the ideology of the Catholic Church with respect to 
dogmatic doctrine (authoritative scale) and nondogmatic doctrine (nonauthor- 
). There were no significant differences between the tested groups 
alues ranged from 07 to 6.08). Further analyses by tests of 
the difference between the differences of the Italian and Irish HAAs and of 
the Italian and Irish LAAs also yielded negligible results. To account for 
these insignificant findings it would seem that differences in religious attitude 
related to ethnic backgrounds may have been overridden by the uniformity 
of the subjects’ present religious education. These results are particularly 
h as the original attitudes of the two groups were remark- 


Chi-square tests were 


itative scale 
(chi-square V 


interesting inasmuc 
ably different (11). 
4, Parental Identification 
Two 1 identification were considered : (а) Е А 
perception as me the Osgood Test and the Similarity Perception 
Questionnaire ; and (2) involvement as measured by the two TAT cards and 
the Involvement Questionnaire. Subjects who expressed no similarity for 
either parent were eliminated from the oa kae (Table 4) indicate 
that Italian HAAs identified more closely with the father, while the Irish 
ded to identify more closely with the mother, as measured by the 
HAA ten md add ; larity scores, For LAAs the same 
vend found for the semantic differential 


semantic diff ere 
7 r-Irish mother re | i r 
зү oo er, the Similarity Perception Questionnaire, possibly because of 
score. Howeve?s y 
ificity, WAS 
its brevity and specificity» w 
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TABLE 4 
RESULTS OF CHI-SQUARE TESTS OF THE DIFFERENCES BETWEEN ITALIAN AND IRISH 
HicH ACADEMIC ACHIEVERs (HAA) AND BETWEEN ITALIAN AND IRISH Low 
ACADEMIC ACHIEVERS (LAA) ох Four PARENTAL SIMILARITY 
PERCEPTION AND INVOLVEMENT SCORES 


Perceived Semantic Involvement Е Involvement 
Groups Similarity Differential Questionnaire Score 
HAA 24.3608 34.27%% 36 24 
LAA .64 9.61** .01 58 
* >< 0], | Е 
ser 20.001, 


ment Questionnaire and the involvement score yielded no statistically reliable 
differences. Tests of the difference between the differences between Italian 
and Irish HAAs and Italian and Irish LAAs indicate that HAAs differed sig- 
nificantly more than LAAs on both the perceived similarity and semantic differ- 
ential scores (Table 5). It would seem that the significant findings of the 
differences between both the HAA and LAA Italian and lrish groups reflect 
ethnic differences regardless of achievement motivation. However, since the 
difference between the Italian and Irish HAA groups is significantly greater 
than the differences between the Italian and Irish LAA groups, the present 
sample suggests that the more closely the individual identifies with the 
parent who plays a dominant role in his ethnic background, the greater is 
his achievement motivation (17). "These results are of particular import in 
view of recent studies dealing with the relationship between achievement 
motivation and child-parent interaction and identification. In general HAAs 
tended to identify with the Same-sex parent and seemed to have rather authori- 
tarian mothers and less domineering fathers than did LAAs (2, 13, 25). It 
is interesting to observe that Italian HAAs in this sample identify with a 
paternal model which traditionally personifies authoritarianism, bossism, and 
TABLE 5 
RESULTS OF THE ¢ TESTS OF THE DIFFERENCE BETWEEN DIFFERENCES FoR ITALIAN AND 
Ікіѕн HIGH ACADEMIC ACHIEVERS (HAA) AND ITALIAN AND Irish Low 


ACADEMIC ACHIEVERS (LAA) on PARENTAL SIMILARITY 
PERCEPTION AND INVOLVEMENT Scores 


Italian and Italian and | 
Variable Irish НАА Irish LAA t 

Perceived Similarity 1.33 28 3,38** 
Semantic Differential 1.57 .85 2.50* 
Involvement Questionnaire 20 13 .28 
Involvement Score .07 .03 46. 
СТРАН 
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an unchallenged source of power. As a possible explanation for this finding 
one might suggest that the Italian HAAs, because of their more advanced 
acculturation into the American value system, are more tolerant of their 
original ethnic background and less susceptible to its negative influence than 
are Italian LAAs. Another interpretation may be that in the HAA group the 
father plays a less authoritarian and dominant role than in the original model. 
Consequently, the Italian HAAs might identify more easily with him along 
traditional patterns. In the case of the Italian LAAs, who are less assimilated 
than the Italian HAAs, paternal rigidity and authoritarianism seem to prevent 
the emergence of a strong achievement need in their sons. By contrast, the 
identification of the Irish HAAs with mother, which runs contrary to the 


-sex parent model, could reflect a specific ethnic dimension, in that her 
nant family role as compared with the father could 


ons’ achievement motivation, but could also impinge 


upon their identification process. In the LAA groups a rather passive adherence 
to original ethnic values appears to influence negatively the performance of 
both Italian and Irish subjects. In the HAA groups, on the other hand, per- 
sistence of ethnic factors may positively stimulate the group members toward 
self-realization. Thus, in the pluralistic society of the United 

with the original ethnic culture become conflict-free 
lation, they may provide a greater impetus toward 


same 
traditionally more domi 
not only influence the s 


achievement and 
States, when group ties 
in the process of assimi 
success and self-affirmation. 

D. SUMMARY 

effect of ethnic background on achievement 
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morvan 0 of Irish descent. Each ethnic group 
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dos НАА P048 cuis obtained from the test battery generally supported 
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EXPLORATIONS IN METHODOLOGY: THE SCIENTIFIC 
QUESTION AS А SCHEMAPIRIC DIMENSION* 


The Ohio State University 


SvEN LUNDSTEDT 


A. INTRODUCTION 


In a recent article (3, p. 894), S. S. Stevens reminds us eloquently of the 
"curious antagonism which has sometimes infected the relations between mea- 
surement and statistics." ‘The “schemapiric principle" joins together the 
. informal, analytic, schematic features of model-building” with the 
"s. . concrete, empirical, experiential observations by which we test the 
usefulness of a particular representation.” Notwithstanding arguments against 
Stevens’ philosophy of measurement, one can find much comfort in his useful 
dictum of scientific responsibility that Schematics and empirics are both 
essential to science, and full understanding demene that we know which is 
which” (3, р. 856). Consequently, this essay is not а quarrel with Stevens 
essentia "sis, Its aim is to question only the extent of his argument and to 
xo dn itical schematic dimension is equally as important 
veg HE ed ee This dimension concerns the root of inquiry 
initia. in scl s * 
itself, dh scientific question. 
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C. Questions AND HYPOTHESES 


The usual canons of scientific method traditionally distinguish between the 
alternative and the null hypothesis, a convention which does not invalidate 
the author's argument. Initially, the chief motive for any inquiry is a question, 
not a formal hypothesis which js simply an edited version of one’s question. An 
hypothesis always originates as a question, and remains like a question even 
in its declarative form because of its tentativeness and its appeal for proof or dis- 
proof. The relationship between the two has probably always been confusing. 
Their differences are formal and not substantive. In any event, research practices 
are often “quick and dirty” in this connection. Owing to such confusion we 
often lose sight of the primary value and utility of the research question, which 
is to be found in its quality and relevance to some important problem, Ac- 
cordingly, there are only formal grounds for hairsplitting about the apparent 
differences between questions and hypotheses. Ultimately, we may not require 
the null hypothesis as an extra step in the research process. It may become 
necessary only to state one’s research questions in their proper form, and 
then to follow this initial step with an appropriate empirical and logical test 


of the answers to them, which may simply be a quantified form of yes, 
no, or maybe. 


D. OTHER CRITERIA ror CHOOSING QUESTIONS 


The criteria for developing a question are both substantive and formal. A 
concern with substance js always reflected in the question: Is it important 
enough to worry about? A question is trivial when it is superficial and does 
not add something tangible to the solution of a problem. It should cut to the 
heart of an issue. But substance has always been notably affected by form, 
and form by substance. If the form of a question is poor, its substance may not 
become apparent. The formal qualities of a question are determined by how 
well it fits accepted standards of logic and language. 

At least six forms of research question can be distinguished, the most funda- 
mental of which are the simple affirmative and negative. They receive this 
rank because they contain the least amount of information of any of the six. 
“Is X true?” and “Is X not true?” illustrate this elementary form. We 
recognize in them the familiar statement of identity. 

A third and a fourth form are the conjunctive and disjunctive, so called 
because they connect two or more affirmatives with the word "and" or 
separate them with the word "or." For example, "Is it true that individual 
A has quality X and is it true that individual A has quality Y ?" are illustra- 
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tions. Replacing and with or makes it a disjunctive form. Accordingly, asso- 
ciation and exclusion constitute the new information within both. 

A fifth and a sixth form are the conditional and biconditional questions, so 
named because they affirm that one prior condition is dependent upon another 
subsequent one, or that the two are interdependent with one another. We 
guage framework of the cause and effect relationship 
between any two things. Contained in them are not only identity and associ- 
ation, but dependency and interdependency as well. Both are clearly an im- 
provement. A conditional is of the form, “Is it true that if some individual 
possesses quality X then will it result in that individual possessing quality weet 
An answer yes to this question specifies a relation of dependency, in one 
direction only. If the biconditional form is used, then the question, “Ts it true 
that if individual A possesses quality X if and only if it is true that the same 
individual possesses quality Y?" expresses а relation specifying interdependency. 
A familiar, informal way of expressing such relations is to inquire if X causes 
Y and Y causes X (in this latter case) or to inquire if X only causes Y in the 


former. | DEM 
A common mistake is to confuse the form and intent of a conditional or bi- 
d intent of a conjunctive one; for ex- 


iti i i rm an 

conditional question with the fo ntent for ex 

ample, to assume a cause and effect relationship where only an association is 

evident As a rule, we may only infer from an association between two events 
^ , 


that a cause and effect relationship exists between them; it is a matter of 

circumstantial evidence only. As a useful research strategy, however, we can 

often dnt good educated guesses about cause and effect based on answers to 
8 


а conjunctive question. . 
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E. ScHEMAPIRICS AND QUESTIONS 
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and to this extent are like self-fulfilling prophecies, then we are ahead of the 
game by knowing how we plan to begin. Our ability to predict and control 
is enhanced. 

In Table 1, strategy 4.5 in the upper right-hand cell would traditionally 
be the best one. Not only does it employ a question form that takes account 
of the mutual influence of the independent and dependent variable, but it 
uses the ordinal scale of measurement said to have the most information. If, 


TABLE 1* 
ExaMPLES OF RESEARCH STRATEGIES 

Type of Type of scale 

question Nominal Ordinal Interval Ratio 
Biconditional 1.5 2,5 5.5 +5 
Conditional 14 24 34 +4 
Conjunctive 1.3 2.3 3.3 +3 
Disjunctive 1.2 2.2 3:2 4.2 
Affirmative or 

negative 11 21 34 +1 


а The index number 1.1 above indicates a scale and question combination having 
the least information, while the index number 4.5 indicates that combination with the 
most information, 
as Stevens says (3, p. 850), “The permissible transformations defining a scale 
type are those that keep intact the empirical information depicted by the scale," 
then the present author would add that similar criteria can also be used to iden- 
tify a good question. Likewise, it is evident that strategy 1.1 is a weaker form 
containing less information. 

Between these two extremes lies the popular arena of debate as to those 
answers to our questions which constitute greater or less empirical truth. As 
a rule, the physical and biological sciences traditionally come out much better 
in this debate than do the social sciences, although often the comparison has 
been wrongly an invidious one favoring, unjustly, the so-called “hard” sciences. 
In its own way the intellectual challenge for the social sciences is apparently 
no less demanding and the original distinction has become questionable. 

But, physics and chemistry can indeed lay claim to many research strategies 
which are genuinely more rigorous. Their greater control of the research 
environment undoubtedly accounts to some extent for their eloquent progress. 
Like psychology, biology often comes off less well, by comparison, by virtue of 
its greater dependency upon its subject matter. | 

Even worse off are those social sciences which investigate human social 
behavior. They cannot expect to lay claim to much more than the bottom and 
top left hand portions of the table which contains strategies 1.1 to 2.5. There 


E 
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are those who would argue that unless the social sciences begin to approach 
the level of phenomenological control typical of these other sciences, they 
cannot hope to use effectively all strategies. 

Whether an interval scale, for example, can be used effectively to express 
continuous data from human populations is a question yet to be resolved. 
Maybe the term "interval" can only be an approximate concept at best. 
Thurstone and Chave (4) ran head-on into this problem when they tried 
(unsuccessfully) to find interval measures of people's attitudes. Their problem 
lay in controlling the variations in the judgments required of those who formed 
the panel of experts who were to select the intervals in their attitude scales. 
Corrections later applied to Thurstone’s original data, reported by Hovland 
and Sherif (1), demonstrated that such interval scaling was indeed subject 
to major personal biases, and that, such scales were only rough approxima- 
tions of a truc interval scale. Efforts continue to be made to improve this 
kind of situation, and some recent measure of success is evident. But on the 
whole, those who work on problems in the social sciences are restricted at 
present with the ordinal scale as their main tool. Knowing this one has to 
make the best of it, one facet of which can be to improve the quality and power 


of one’s initial research questions. 

The ethical problems raised by 
d by the conditiona 
al sciences. I 


experimental manipulation and control, and 
1 and biconditional form of question, are 
f logic and scientific method form the 
e to increase our knowledge of mankind’s social and 
fields we may always be severely limited in the ex- 
tent to which we can pursue such knowledge freely. Even now it is no longer 
possible to pursue knowledge for its own sake when it is evident that the human 
price is too high. Increasingly, this facet of social and psychological research is 
becoming a recognized boundary beyond which certain research methods are 
forbidden. The social and behavioral sciences do indeed have rather unique 
constraints placed upon them by the social and ethical contexts in which they 


are expected to perform. 


often require 
real obstacles in the soci 
only path we wish to tak 
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F. CoMPLEXITIES AND FuRTHER PROBLEMS 
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always have to deal with the issue of the ultimate functional value of an ex- 
tensive methodological schema too. Method for its own sake is hardly worth- 
while. Somewhere our sense of proportion must enter into the picture. We have 
von Neumann's warning to lean upon, which Stevens cites too: “After much 
‘abstract’ inbreeding, a mathematical subject is in danger of degeneration. At 
the inception the style is usually classical; when it shows signs of becoming 
baroque, then the danger signal is up” (5, p. 196). This might be said of 
research method. 


С. PLACE or STATISTICS 


Nothing has been said thus far about the other side of the "schemapiric" 
issue: namely, the choice of an appropriate statistic to go along with the form 
of a research question and scale. This is not a trivial choice. Among the diffi- 
culties arising from wrong choices of this kind are those of claiming too much 
from one's data and those of claiming too little. 

Measures of association or correlation are clearly appropriate for the con- 
junctive form. In special cases where prior methodology is experimental and 
so stipulated beforehand, such measures may even be used with the condi- 
tional and the biconditional. But, when no visible prior evidence of interde- 
pendency exists between two variables, then the associative form of statistic 
has an entirely different meaning empirically—a subtle and important differ- 
ence. Space does not allow further analysis of the fit between questions, scales, 
and statistics here; but one which this author plans to continue. Table 2 sum- 
marizes the main thrust of this argument by identifying some categories of 
statistical methods appropriate in the different strategies. 

The terms location, dispersion, and association are those used by Stevens 
and they involve such statistical measures, for example, as the mode, median, 
and arithmetic mean (location), percentiles, and standard or average deviation 
(dispersion), and rank correlation and product-moment correlation (associ- 
ation, dependency, and interdependency in special cases). The term differ- 
entiation applies to any technique which seeks to analyze the apparent 
difference between two variables, such as a correlation coefficient viewed 
negatively. Obviously, there is interpretive flexibility possible in the different 
measures. 

Strategies 1.5, 1.4, 1.3, 1.2, and 1.1 are evidently limited by the constraints 
imposed by the nominal scale. The range of the simple affirmative or negative 
question appears to govern part of the choice of statistical measures in strate- 
gies 1.1, 1.2, 1.3, and 1.4. It is only when we move into the higher level 
strategies that we begin to experience more options. For example, in strategies 
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à TABLE 28 
EXAMPLES OF RESEARCH STRATEGIES AND POSSIBLE STATISTICAL MEASURES 


Type of scale 


Type of 
question Nominal Ordinal Interval Ratio 
Biconditional Location Location Location Location 
Dispersion , Dispersion Dispersion Dispersion 
Differentiation Differentiation Differentiation Differentiation 
(1.5) Association Association ‘Association 
Dependency Dependency Dependency 
Interdependency Interdependency Interdependency 
(2.5) (3.5) (4.5) 
Conditional Location Location Location Location 
Dispersion Dispersion Dispersion Dispersion 
Differentiation Differentiation Differentiation Differentiation 
(1.4) Association Association Association 
Dependency Dependency Dependency 
24 (3.4) (4.4) 
Conjunctive Location Location Location Location 
Dispersion Dispersion Dispersion Dispersion 
Differentiation Differentiation Differentiation Differentiation 
(1.3) Association Association Association 
(2.3) (3.3) (4.3) 
Disjunctive Location Location Location Location 
Dispersion Dispersion Dispersion Dispersion 
Differentiation Differentiation Differentiation Differentiation 
(1.2) (2.2) (32) (42) 
Affirmative Location Location Location Location 
or negative Dispersion Dispersion Dispersion Dispersion 
(1.1) (2.1) (3.1) (41) 


а The index numbers—e.£» 4.5—above are those used in Table 1 and have the same 


meaning. 
3.4, 3.5, 4.4, and 4.5 ideally several additional statistical techniques may be- 
come allowable because of the assumptions underlying the questions and scales. 
In the final analysis all choices become governed, in turn, by the quantity and 
quality of one's initial data, which also may, or may not, rule out use of a 


given strategy. 
H. SuMMARY 


This paper began with the proposition that another critical dimension above 
and beyond those of schematics and empirics in their traditional sense is neces- 
sary initially in scientific endeavor. This dimension, the root of inquiry itself, 
is the scientific question. Several considerations making the valid structure of 
a good question were discussed, including the distinction between questions 
and hypotheses. Problems surrounding choice of one’s research question were 
said to be substantive and formal. The substantive aspect of а question is its 
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ultimate value and relevance. The formal aspect is to be found in its linguistic 
and logical structure. 

Six formal questions were discussed, each containing successively greater 
explanatory power and called the simple affirmative, negative, conjunctive, dis- 
junctive, conditional, and biconditional. This was followed by an analysis of 
some of the relations between questions, schematics, and empirics, as the latter 
two are defined and discussed by S. S. Stevens. The point is made that taken 
together, the form of a question and a scale can be used as a criterion for one's 
strategy in research. A wedding of "schemapirics" and questions should result 
in a better scientific tool. Finally, the relations of questions, scales and statistics 
to one another are discussed briefly indicating some of the broad initial choices 
that are possible within each of 20 strategies for research. There is need for 


further clarification of the relationship between questions, empirics, and other 
schemata as essential tools for research. 
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A COMPARISON OF TECHNIQUES FOR THE SCORING OF 
AVOIDABLE ERRORS ON THE RAVEN 
PROGRESSIVE MATRICES* 1 


Central Islip State Hospital 


Joux Е. Fnaccura, DIANE FIORENTINO, CHARLES SHEPPARD, 
AND SIDNEY MERLIS 


A. INTRODUCTION 


The Raven Progressive Matrices (RPM) was designed to measure the 
ability to form comparisons and reason by analogy, thereby providing an index 
of intellectual functioning (6). Both clinical practice and psychopathological 
dynamics suggest a relationship between type or degree of mental illness and 
disturbances in abstract processes, such as concept formation, comparative 
reasoning, and analogic reasoning. The RPM, while not primarily designed 
to assess the ideational effects of mental illness upon complex reasoning abil- 
ity and intelligence, has been utilized in that capacity. Research findings (2, 
3, 8, 9) appear contradictory with respect to RPM efficacy in distinguishing 
pathological influences. 

Sheppard (8, 9), using an avoidable error measure based upon item diffi- 
culties, found significant differences in the number of RPM reasoning errors 
made by narcotic addicts having various MMPI profile patterns. Kasper used 
deviations from expected set scores as an index and reported a failure “to 
demonstrate the efficiency of PM in discriminating the influence of pathology” 
(3, p. 24). Jurjevich (2) found no strong relationship between the incidence 

£ neurotic or characterological aberration when 


of avoidable errors and degree o 
or scoring procedure was employed. 
dy is to investigate the extent to which defining and 
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to detect differences in the abstract reasoning performance of MMPI diag- 

defining avoidable errors in terms of 
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the ideational effects of different types of mental illness. 
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B. METHOD 


Three techniques for scoring avoidable errors were applied to RPM answer 
sheets of 88 narcotic users. The first method was derived from Raven's (6) 
‘concept of expected set score based upon the total test score. Raven (6) pro- 
vides a table with the total number of items a testee should correctly solve 
in each set for a given score on the test. Additionally, items in each set are 
organized in increasing order of difficulty. If the testee, given his total score 
on the test, missed easier items in a set while solving more difficult items or 
missed more items than were expected for each set; these misses were scored 
as avoidable errors. Frequency of avoidable errors was summed across the 
five sets to establish a total Raven avoidable error score (R-AES). 

The second method was called judges’ rating of difficulty levels (JRDL)- 
This technique defined an avoidable error as a failure to solve an item whose 
difficulty level was within the testee’s range of ability as measured by his per- 
formance. Trained judges, using a table of empirically determined item diffi- 
culty levels (7), examined each RPM answer sheet, estimated the lowest 
item difficulty level the testee consistently passed, and scored the easier items 
missed as avoidable errors. The frequency of avoidable errors was summed to 
define the JRDL-AES. 

'The third method represented an attempt to attain increased objectivity by 
eliminating the need for judges and to overcome the apparent lack of con- 
sistent ordering of increasing item difficulty on the RPM. A frequency dis- 
tribution of item difficulty levels, with a class interval of .10, was constructed 
from data reported in an earlier study by Sheppard е? al. (7). Within each 
band of difficulty levels, the number of items needed to pass 70 percent of the 
items was determined. A cutoff item-difficulty value, the lower limit of the 
item-difficulty interval containing the most difficult item meeting the 70 per- 
cent criterion, was found for each testee. Items of easier item difficulty, 
higher than the cutoff value were scored as avoidable errors. "These were 
summed to define the objective rating of difficulty levels (ORDL-AES)- 
Product-moment correlation coefficients and percent agreement were used to 
measure the degree of association and congruence between the three techniques. 

The 88 subjects studied represented subsamples of narcotic users established 
on the bases of MMPI profile patterns. Rules for profile interpretation defined 
by Gilberstadt and Duker (1), Marks and Seeman (4), and Meehl and 
Dahlstrom (5) were followed in constructing the MMPI profile groups. Data 
in Table 1 summarize the characteristics of this sample with regard to age, 
education, duration of heroin use, RPM centile, and MMPI scale scores. 
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Analysis of variance indicated nine significant F ratios on the MMPI scales at 
the .05 level or less. Subsequent £ tests defined the reliability of profile assign- 
ment. Four profile patterns were established on the basis of MMPI T score 
elevations exceeding two standard deviations on the depression (2), psycho- 
pathic deviate (4), psychasthenia (7), schizophrenia (8), and hypomania (9) 


scales. 


TABLE 1 
Usinc Herois, RPM and IPAT CzNTiLEs, AND MMPI 


Mean AGE, EDUCATION, YEARS 
VaLUEs FOR Four MMPI PnoriLE TYPES 


T Scores, STANDARD DEVIATION, AND F 


42" 428' 987' 49° 
N=20 N=19 = 15: М=36 Р 

Variable М SD M SD М SD М SD rato —$ 
Age 29 669 29 558 30 572 31 721 35 м 
Education 10 2.60 9 2.03 9 2.10 10 242 .26 NS 
Heroin use 9 4.62 š 3.80 11 4.67 10 4.98 .90 NS 
Raven centile 48 29.60 50 25.83 41 30.00 58 25.89 146 NS 
IPAT 38 2845 29 26.05 32 29.04 36 28.34 .38 NS 
L 48 6.47 51 7.90 48 9.13 49 7.81 .72 NS 
F 5з 3181 54 2335 46 2743 SE 1336 168 — NS 

K 53 7.95 51 8.32 44 8.90 51 8.41 3.67 05 
Hs 35 548 55 682 52 1286 52 805 111 NS 
73 9.69 62 10.72 58 9.14 14.86 .001 


D 71 672 
Hy По we 57 33 2 7M S2 209 ы, 
31 802 70 1001 76 797 463 .01 


Pd 76 7.32 

Mf 58 7.56 59 9.99 63 8.24 56 10.28 2.11 NS 
Pa 31 816 67 952 65 60 52 514 — 2548 .001 
Pt i9. war € aos 72 13256 SE — 689 1640 с 
Se. 35 488 71 780 73 2017 55 493 2147 ‘001 
Ma 58 8.19 64 6.30 79 10.20 7 5.95 27.23 .001 
Si 32 677 56 672 56 $22 48 5461 914 1001 


nd 987’ profile patterns were defined as secondary 
athic, paranoid, and mixed schizophrenic personality 
ale scores for these profile 


The 42', 49? 428', a 
sociopathic, primary sociop: 
types. Data seen in Table 1 regarding mean sc 


groups were consistent with clinical expectancy. 
Tests of significant 7 are reported for groups having common overlapping 


scales, The 42’ group scored significantly higher on the depression scale than 
the 49’ group who scored significantly higher on the hypomania scale (? = 
5.48 and 6.70 respectively). The 428’ group scored significantly higher on the 
schizophrenia scale than the 42’ group, thus maintaining separation between 
these profile patterns on this dimension (¢ = 7.52). The 987’ group differed 
significantly from the 428’ group in elevation on the psychasthenia (7 = 3.00) 
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and hypomania (¢ = 5.11) scales. The 987’ group differed from 49’ group 
patients in elevation on the psychasthenia (* = 6.43) and schizophrenia 
(2 = 4.91) scales. 

With the definition of the groups established, analysis of variance was 
applied in order to test the significance of differences between the methods of 
defining avoidable errors with regard to their sensitivity in detecting func- 
tional differences in the reasoning performance of the four MMPI groups. 


C. RESULTS 


Measures of the degree of similarity between the three scoring methods, 
based upon the correlation coefficients and percent of RPM's scored between 
a single avoidable error of one another by each technique, indicated that ORDL 
and JRDL were most alike. Product-moment coefficients for ORDL-JRDL 
AES were .75, while ORDL-R and JRDL-R AES correlated .52 and .72 
respectively, Although the correlations for ORDL-JRDL and JRDL-R scor- 
ing were almost identical, percent agreement measures revealed that 73 per- 
cent of the answer sheets were scored within one avoidable error of another 
by the ORDL and JRDL methods, while the corresponding values for JRDL- 
R and ORDL-R were 48 percent and 45 percent. Tests of significance for 
differences between Proportions showed greater agreement between JRDL- 
ORDL than between either difficulty level method and R-AES at the .01 
level. These findings Suggest that the two difficulty level methods define and 
generally score avoidable errors in the same way. 


TABLE 2 
\voABLE ERROR MEANS, AND STANDARD DEVIATIONS FOR THREE 
SCORING METHODS AND Four PROFILE PATTERNS 


Profile Patterns 


2042 .. 987' | 428 2039 __ 
Scoring Х = 20 N = 15 №19 N = 34 
method M SD M SD M SD M SD 
R 495 218 4.55 1.67 5.9 249 4.74 224 
JRDL 3.30 — 2.53 3.80 2.83 5.68 3.235 3.4 1.82 
ORDL 295 31 3.73 2.69 5.95 400 $32 198 


Table 2 presents a summary of the avoidable error means and standard 
deviations for profile groups according to the scoring method used. Analysis 
of variance indicated that both profile groups (F = 4.10, df = 3/84, p < .01) 
and scoring method (F = 10.21, df = 2/6, р < :01) were significant. 

Tests of significance 7 indicated R-AES scored the profile groups differently 
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from the other methods. Significantly more avoidable errors were assigned 
to the primary and secondary sociopathic groups by this method than by 
either difficulty level method (42:R > JRDL, ¢ = 2.16, р < .05; К > 
ORDL, г = 229, p < 05; 49R > JRDL, г = 268, p < 02; R > 
ORDL, ғ = 3.15, p < .01). Differences in avoidable error means td 
profile patterns only showed significant variation when scored by JRDL or 


ORDL methods. Both found the paranoids made more avoidable errors than 
sociopaths (JRDL 28 > 42, t = 245, р < .02; 428 > 49, ғ = 3.14, 
р < 01; ORDL:428 > 42, t = 2.55, p < .02; 428 > 49, 1 = 3.38, 
p < .01). These data support the hypothesis that the manner of defining and 
the method of scoring avoidable errors influenced the number of avoidable 
ject groups and thus appear to be factors in the ability 


errors assigned to sub 
ational effects of different kinds of mental 


of this index to distinguish the ide 
illness upon intellectual reasoning. 


D. Discussion 
and scoring RPM avoidable errors in 


terms of empirically determined item-difficulty levels yielded a measure of com- 
ment capable of discriminating between MMPI 
profile-pattern-defined patient groups. Two scoring methods defining an avoid- 
able reasoning error as ча failure to solve an item whose difficulty level was 
within the testees range of ability, as determined by his performance" (8, 
p. 1043) were shown to be sensitive to differences in pathological type. Both 
scoring methods indicated significantly more avoidable errors were made by 
Ss with paranoid profile patterns than by Ss with either primary or secondary 


atterns. А scoring method using expected set scores as the basic 
oidable errors, applied to the same groups, did not show 


differences in the number of avoidable errors found in 


The data presented suggest defining 


plex-reasoning-ability impair! 


sociopathic p 
defining concept for ау 
statistically significant 


the groups. "A 
When mental illness affects ir al ability in such a way as to produce 


stency and accuracy of performance, appropriately defined 
{ avoidable error indices appear to be capable of 
tude of impairment in different patient 
groups. The appropriate definition, it seems, treats any item missed whose 
item-difficulty value is greater than an item-difficulty value representing the 
most difficult items an individual is capable of consistently solving as an avoid- 
able error. The various definitions and scoring procedures for avoidable errors 
employed by different investigators may be one of the reasons results of pre- 


ntellectu 


irregularities in consi 
ectively scored RPM 


and obj 
e magni 


estimating ог assessing th 
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vious research in this area appear contradictory (2, 3, 8, 9). The finding that 
conclusions based upon the same set of data would differ if the data were 
scored according to somewhat different avoidable error rationales provides 
support for this belief. 


Е. SUMMARY 


The Raven Progressive Matrices of 88 narcotic users, divided into four 
MMPI profile pattern groups, were scored for avoidable errors with the use 
of three scoring methods. Two of the methods defining avoidable errors in 
terms of empirically determined item-difficulty levels indicated significantly 
more reasoning errors were made by patients with paranoid profiles than 
by patients with either primary or secondary sociopathic patterns. The third 
method, based upon expected set scores, did not score significantly different 
numbers of avoidable errors in the four groups. The data were interpreted 
as indicating that the manner of defining and that of scoring RPM avoidable 
errors are important factors in determining sensitivity to reasoning errors ге- 
lated to pathological ideation. It was suggested that where mental illness 
affects consistency of performance, appropriately defined and objectively scored 


avoidable error measures can be used to assess the impairment of intellectual 
functioning. 
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LAYMEN'S IDENTIFICATION OF ITEM CONTENT ON 
THE EYSENCK PERSONALITY INVENTORY** 


Department of Psychology, University of New England, Australia 


GonpoN STANLEY 


A. INTRODUCTION 


Recent work on the Eysenck Personality Inventory (EPI) by Gorman (2) 
and Salas (3) has indicated that the scale scores of the EPI are susceptible to 
successful faking by psychologically unsophisticated Ss. Wiggins (4) has sug- 
gested that such faking may be possible because the Ss know which behaviors 
are appropriate or inappropriate for the person's reference group. Previous 
workers have demonstrated that the EPI scales are susceptible to motivational 
sets; the present study is an attempt to determine how successful psycholog- 
ically unsophisticated Ss would be at identifying the psychological significance 


of the EPI items. 
B. METHOD 


1. Subjects 


e 31 adults of both sexes attending their first introductory 
adult education center in northern New South 
a rural setting and most of the students 
ions. Only one of the class had been 


The Ss wer 
lecture in psychology at an 
Wales, Australia. The town is in 
were employed in nonprofessional occupat 
to university and none had formal training in psychology. 


2. Procedure 


The Ss were tested in а group setting and asked to complete the EPI ac- 
cording to standard instructions. The Ss were given code numbers and told 
that their individual responses would be treated confidentially. After all Ss 
had completed the inventory, the tests were collected by E and a second copy 
given to each S with the instruction to classify items according to content into 
three categories corresponding to the three scales of the EPI. A very brief 
definition of the extraversion (E), neuroticism (N), and lie (L) scale cate- 
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gories was given on the basis of the description given by Eysenck and 
Eysenck (1). 
C. RrsuLTS 

The mean correct identification of E scale items was 16.48 (o = 4.00), 
the mean № scale identification was 15.42 (о = 4.74), and that for the L 
scale was 4.61 (о = 2.57). These identifications represent 68.66, 64.25, and 
51.22 percent respectively of the scale items. 

Product-moment correlations between the $s own performance on the 
scales under standard instructions and the number of items correctly identified 
on each scale were computed for each scale. The respective correlations for 
the E, N, and L scales were .115, .059, and .023, suggesting no systematic 
relationship between the Ss’ number of responses on the scale and their number 
of correctly identified items. ў 


D. Summary AND Discussion 


Thirty-one people attending an introductory psychology lecture at an 
adult education center were asked to complete an EPI under standard in- 
structions and under instructions to classify the items according to content 
into the three scale categories. Over half the items in each scale were correctly 
identified. There was no significant correlation between the number of items 
identified and the number of items endorsed for each scale. However it 
should be noted that in the present case the standard completion of the in- 
ventory did not occur in a highly ego-involving situation. The fact that "lay 
psychologists" with no professional training are able to identify successfully 
the psychological significance of over half the items on the inventory suggests 
that the instrument has little value for serious classification purposes under 
conditions where Ss may be motivated to distort their responses. 
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THE FACTORIAL DETERMINATION OF TYPES OF 
CONFORMING INDIVIDUALS* 


Department of Psychology, Brigham Young University 


SAUNDRA Е. ZIMMERMAN, DARHL M. Peversen, AND Kay Н. SMITH 


— 
A. INTRODUCTION 

made to determine factor analytically types of 
ach to isolating types of conforming individuals 
is the obverse factor analysis method in which individuals are correlated over 
a set of measures. The present research included measures of personality char- 
ormity. Conformity was measured by both paper and pencil 
essure situation similar to Crutchfield's (4). 
performed to determine factors which repre- 
d with either paper-pencil measures or a 


Few attempts have been 
conformity (1, 5). One appro: 


acteristics and conf 
tests and a behavioral group рг 
An obverse factor analysis was 
sented types of individuals associate 
behavioral measure of conformity. 
В. METHOD 


f 38 attributes were obtained from 119 students at Brigham 
The following measures were used to assess conformity: 
dence subscales of the Survey of Interpersonal Values 
nt Scale (OAS) developed by Couch and Keniston 
acquiescence response set ; and group pressure yield- 
the amount of conformity in the group 
onality measures were administered : 


-Welsh Art Scale (2), Personal Ori- 


Measures 0 
Young University. 
conformity and indepen 
(6) ; the Overall-Agreeme 
(3) which is related to the 
ing (GPY) score determined by 
ion. The following pers 


Rokeach Dogmatism Scale (8), Barron 
entation Inventory (10) with the subscales of self-actualizing values, existen- 


tiality, feeling reactivity, spontaneity, self-regard, self-acceptance, synergy, 
nature of man as constructive, acceptance of aggression, capacity for intimate 
inner versus other directedness, and time incompetence versus time 
Survey of Interpersonal Values (SIV) with the subscales of 
recognition, independence, benevolence, and leadership; 
1РАТ Eight Parallel Anxiety Battery (9); Personal 
hich obtained age, SCX, year in college, frequency of 
hers and sisters, number of older brothers 


pressure situat 


contact, 
competence ; 
support, conformity, 
Forms B and F of the 
Data Questionnaire w 
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and sisters, father's age, amount of father's education, father's occupation, 
mother's age, amount of mother's education, and parents’ income. Grade point 
averages and scores on the American College Test were also obtained. 

A 119 X 119 matrix of correlations between Ss was obtained from the 
raw score matrix. Factors were extracted with the use of the principal axis 
method and rotated with the use of the normal varimax criterion (7). A mul- 
tiple regression analysis was completed for each factor with factor loadings 
larger than .30 utilized as the dependent variable in order to ascertain which 
of the 38 variables accounted for the variance within each factor. Those inde- 
pendent variables which accounted for at least one percent of the total vari- 
ance were selected and used to interpret the factors. 


C. RzsuLTS 


Twenty-nine factors were generated to account for the total variance and 
then rotated. The first 16 factors which accounted for 79 percent of the vari- 
ance were included in the multiple regression analysis. Factors I through XVI 
accounted for 11.5 percent, 8.7 percent, 8.0 percent, 8.1 percent, 6.2 percent, 
4.9 percent, 4.6 percent, 4.2 percent, 3.9 percent, 3.5 percent, 3.2 percent, 
3.0 percent, 2.8 percent, 2.6 percent, 2.4 percent, and 2.3 percent of the vari- 
ance, respectively. Only 11 factors were related to either GPY score, SIV con- 
formity, OAS, or SIV independence. Table 1 gives the significant correlations 
of the personality variables with the factor loadings for each of the 11 factors. 

The group pressure yielding (GPY) score or the behavioral definition of 
conformity was selected by the multiple regression process as predictive of fac- 
tor loadings for three factors—V, VIII, and X. Factor V on which 14 Ss 
received factor loadings beyond --.30 was a “socioeconomic factor,” since the 
higher correlations were with parents’ income, father's occupation, father's 
education, and mother's education, Individuals who belonged to this group 
tended to be nonyielders in the group pressure situation and to come from 
families of high socioeconomic status. Factor VIII (seven Ss received factor 
loading scores beyond 25.30) appeared to represent "ego-defensive" confor- 
mity. Individuals included in this factor demonstrated high GPY scores; yet 
indicated that they needed independence. They also denied having a need for 
recognition, had a lack of sensitivity to their own needs and feelings, and had 
a lack of spontaneity. Factor X (eight Ss were included here) appeared to 
represent the “dogmatic, authoritarian” conformist, Conformers of this group 
tended to be dogmatic, rigid, and close-minded. They had relatively little 
tolerance or understanding for others and tended to be primarily aware of 
their own inner needs. They were low in leadership, basically inner-directed, 
and had no particular motivation to change their perspective. 
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SIGNIFICANT CORRELATIONS OF THE PERSONALITY 


VARIABLES WITH THE FACTOR LoaDINGs 


For Each FACTOR IN WHICH EirHER GPY Score, SIV CONFORMITY, OAS, ок 
SIV INDEPENDENCE WAS SELECTED BY MULTIPLE REGRESSION ‘TECHNIQUE 


Personality variable ri 


Factor V (GPY score predictive 
of factor loadings) 


Rokeach Dogmatism 


Scale .63 
Barron-Welsh Art Scale —.74 
Parents’ income 81 
Father’s education .66 
Mother's education 22 


Father's occupation 
SIV recognition 67 
SIV independence —75 


SIV benevolence —.6+ 
Nature of man as 

constructive .64 

Factor VIII 

Form B2, Anxiety Test 94 
Form F, Anxiety Test .83 
Number of brothers 

and sisters .90 
Number of older 

brothers and sisters 96 
Father's age :95 
SIV recognition 81 
SIV independence —.78 
GPY scores 2 —71 
POI feeling reactivity 76 
POI spontaneity m 
POI synergy — 

Factor X 

Rokeach Dogmatism А 

Scale — 
SIV benevolence m 
SIV leadership 790 
GPY scores . 
Other vs. inner ET 

directedness , p 
POI existentiality | " 50 
POI fecling reactivity 4 
POI self-acceptance E 
POI acceptance of 75 

aggression * 
POI capacity for 76 


intimate contact i 
p II (SIV conformity pre ic- 
menan T actor loadings) 


Rokeach Dogmatism 63 
Scale 56 


Form Bl, Anxiety Test 


Personality variable r 
Factor III (continued) 
Form B2, Anxiety Test “7 
Form F, Anxiety Test 37 
Barron-Welsh Art 
Scale —45 
Church attendance —.64 
Mother's age 49 
Parents’ income 45 
Father's education +53 
Mother’s education 53 
Other vs. inner 
directedness —84 
Time competence 45 
Other directedness —.82 
Inner directedness 85 
POI existentiality 87 
POI feeling reactivity .51 
POI self-acceptance 87 


Nature of man as 


constructive 49 
POI synergy 7 
POI acceptance of 

aggression 78 
POI capacity for 

intimate contact .62 

Factor IV 
Rokeach Dogmatism 

Scale —.66 
Age of subject —.53 
Year in college —.58 
Number of brothers 

and sisters —49 
Number of older 

brothers and sisters —49 
Father's age —.73 
Mother's age —.50 
Parents' income —72 
Father's education —78 
Mother’s education —M 
Father's occupation 6% 
SIV conformity 67 
SIV recognition SA 
SIV independence —73 
GPY score 72 
Time incompetence/ 

Time competence —.86 
Time incompetence —78 
Time competence .85 
Inner directedness —.60 
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f 
TABLE 1 (coztinucd) | 
Personality variable r Personality variable r "| 
Factor IV (continued) Factor XV (continued) 
Self-actualizing value 50 SIV conformity 88 
POI existentiality 49 Self-actualizing value —.65 ^ 
POI spontaneity 46 POI existentiality 75 
POI self-regard +85 POI acceptance of 
POI self-acceptance 71 aggression —.68 
Nature of man as 
constructive 46 Factor I (SIV independence predic- 
КО raced { 75 tive of factor loadings) 
capacity for м 3 
inti - Form B1, Anxiety Test —.$3 
intimate contact 47 Form B2, Anxiety Test =n 
я Form F, Anxiety Test — 76 
Factor XI Sex of subject —.51 
Rokeach Dogmatism Year in college 49 j 
Scale 50 Mother's age 39 ^ 
Form F, Anxiety Test 46 Parents’ income i 
Barron-Welsh Art Father's occupation —36 
Scale —.52 SIV support —77 
Year in college 66 SIV recognition —6t 
Church attendance —49 SIV independence 52 
Number of brothers SIV leadership 6% 
and sisters —.89 Other vs. inner 
Father's age .58 directedness —.51 
Mother’s age 62 Time incompetence —4 
Parents’ income .55 Time competence 46 
SIV support d Other directedness —.61 
SIV conformity 43 Inner directedness .69 
SIV recognition =й Self-actualizing value 70 
SIV benevolence .56 POI existentiality 42 
SIV leadership —.53 POI spontaneity 46 
Grade point average 2—46 POI self-regard 64 
Time incompetence/ POI self-acceptance 152 
'Time competence Si POI synergy J+ 
Other «s. inner POI acceptance of 
directedness -— aggression .65 
Time competence —.55 POI capacity for 7i 
Other directedness —.69 intimate contact 78% 
POI existentiality .57 
POI feeling reactivity 72 Factor XIV 
POI spontaneity .51 Rokeach Dogmatism 
POI self-acceptance .50 Scale 61 
Nature of man as Sex of subject —.80 
constructive —,54 Year in college —.81 
POI synergy AZ Number of brothers 
POI acceptance of and sisters .58 
aggression 48 Mother's age 77 
SIV independence —177 
Factor ХУ SIV leadership 75 y 
Self-actualizing value —.77 
Year in college —.71 POI feeling reactivity —45 
Church attendance ‚77 Nature of man as 
Father's age . -71 constructive —65 
Father's occupation —J4 POI synergy —75 4 


at 
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TABLE 1 (continued) 


Personality variable 7 э Personality variable r 
Factor 11 (OAS predictive of Factor IX us m 
factor loadings) OAS т 
OAS А 94 Rokeach Dogmatism 
Rokeach Dogmatism Scale 56 
Scale 69 Barron-Welsh Art ` 
Form F, Anxiety Test .58 Scale = 
5 .54 
Age of subject .65 Sex of subject A8 
Sex of subject —.86 Year in college 56 
Church attendance —.60 Number of brothers 
Number of brothers " and sisters —7$ 
and sisters 46 Number of older 
Parents’ income —.82 brothers and sisters —52 
Father's education —.70 Parents’ income = (59 
Father’s occupation 73 SIV conformity mabe 
SIV recognition .66 SIV benevolence A8 
SIV independence 66 GPY score —63 
SIV benevolence —.8+ Grade point average "m 
Grade point average —.61 Time incompetence/ 
Other vs, inner js NS competence 63 
directedness = ler vs. inner 
Other directedness —58 P edens —89 
Inner directedness Es Dues pieces n 72 
Self-actualizing value 77 ees тейеш —.88 
POI existentiality — .59 POI poca ‘a .85 
POI feeling reactivity 87 РОЇ рнк pm 63 
POI spontaneity 53 POI spo qu аа .69 
шее шопа 59 POI self-regard x 
constructive Li u “ 
e = ^ Ro A 
POI acceptance of m up rS о , 
aggression r POI canacity 64 
B 4 pacity for 
POI capacity for 72 intimate contact 61 


intimate contact 


‘The paper-penci 
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and doing w 


was selected by the multiple re 
of four factors—III, IV, X 


ulations closely 


] measure of confo 
asure of “doin 
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rmity, SIV conformity, has been defined 
g what is socially correct, following reg- 
dered as acceptable and proper." It 
nting for some of the variance 
nd XV. Individuals (N = 17) grouped in 


d, rather flexible, and acceptant of their feelings. 
It was a difficult factor to interpret, since SIV conformity was correlated 
very low with the factor loadings. Factor IV seemed to represent individuals 
(N — 14) who had high SIV and group ——— 'They were 
relatively dependent, seekers of recognition, 18 socially oriented. They ap- 

t both their strengths and weaknesses. 'They were from lower 
peared to accep thers and sisters, and tended to be younger 


d fewer bro Pp re 
amie Factor IX contained individuals (N = 14) for 


Factor III were inner-directe 
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whom the relationship between SIV conformity and factor loadings was 
moderate and positive. They were inner-directed, somewhat older, firstborn 
or only children, and expressive of their feelings. Factor XV contained indi- 
viduals (N = 7) who demonstrated high conformity to societal demands. 
They came from families from a high socioeconomic level, had high church 
attendance, and appeared to have good social judgment in the application of 
values and principles, 

There were two factors in which SIV Independence was predictive of factor 
loadings—I and XIV, Factor I contained individuals (N — 26) with high 
need for independence, low anxiety, and low need for support or recognition 
from others. On the other hand, they valued leadership highly and tended to 
have high self-regard. Factor XIV contained individuals (N — 10) who 
claimed to have a low need for independence and a high need for leadership. 
They tended to be close-minded, nonflexible, opinionated, and inhibited. Also, 
most of this group were females, 

Overall Agreement Scale scores played a major role in two factors—II and 
IX. In the group which Factor II characterized were individuals (№ = 16), 
Primarily females, who, though they tended to score highly on “tendency to 
agree” (OAS), professed a need for independence. They had a tendency 
to be close-minded and conservative. On the other hand, they appeared to have 
a high capacity for making intimate, meaningful relationships and for expres- 
sing their feelings, Individuals (N = 13) grouped in Factor 1X had low 
OAS scores (i.e., low tendency to agree) and high ability to withstand group 
pressure. They appeared to be inner-directed, to assess reality realistically, 
and to be spontaneous and expressive of their feelings. They also tended to have 
fewer brothers and sisters, 

The results indicate that conformity is complex and multifaceted. Appar- 
ently there is no typical, conforming personality per se. Further research should 
ascertain how each of these types of individuals would behave across a variety 
of situations requiring varying degrees of conforming behavior. 


D. SUMMARY 

Four conformity measures (Survey of Interpersonal Values: Conformity 
and Independence subscales, Overall-Agreement Scale, and a yielding score 
for a Crutchfield-type group Pressure situation) and 34 other personality 
measures were administered to 119 Ss, An obverse factor analysis was com- 
pleted with the use of the principal axis method with a normal varimax rota- 
tion. Sixteen factors which accounted for 79 percent of the variance were 
retained. "Types of conforming individuals represented by the factors were 
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identified by performing a multiple regression analysis for each factor and by 
using the Ss’ factor loadings as the dependent variable and the 38 measures 
as independent variables. Eleven factors which were related to at least one of 
the conformity measures were interpreted. 


10. 
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BIOGRAPHICAL INVENTORIES AND STYLE OF LEADERSHIP* 


Department of Psychology, University of Windsor, Canada 


Joun J. La Сара! 


A. INTRODUCTION 

A common practice in developing biographical inventories is to select items 
that discriminate against a criterion with little regard given to why an item 
works or does not work. Owens (5) cautions that a blind empirical approach 
is less likely to result in an effective instrument than one based on a theoretical 
rationale. Williams (9) found that items derived from specific hypotheses 
were more discriminating than items not based on any rationale. 

For many jobs, no rationale exists for the development of items. This is not 
area of leadership. Though research has demonstrated the effects of 
leadership style on group productivity and morale, little of this has been ap- 
plied to the selection of leaders. Fielder (1) questions the selection of leaders 
according to their preferred style of leadership. Fielder points out that there 
is no best style—it depends on the situation. Subgroup analysis, however, may 
be one solution to this problem. 

The main interest in this study is the development of a biographical 
inventory derived from research indicating that the same style of leadership 
is differentially evaluated at different positions in a hierarchy. Moore (4) 
found in his factor analysis of ratings of Air Force NCOs that subordinates 
described NCOs in terms of human relations and superiors described NCOs 
ability to “get the work done.” The present author hypoth- 
andidates who prefer task-oriented to human-relations- 
e successful leaders. 


so in the 


in terms of their 
esized that officer c 
oriented NCOs are more likely to b 

B. MrrHOD 


1. Subjects 
rs at the U.S. Navy Officer Candidate School. 


The Ss were 493 junior office 
graduates recruited from civilian 


These line officers were recent college 


sources. 
* Rece n 
and published imme 
rnal Press. . i 
The To bn of this paper was read before the Southeastern Psychological Associa- 
tion, Roanoke Virginia, April, 1968. The opinions and conclusions contained herein 
are those of the author and are not to be construed as representing those of the De- 
t of the Navy. 


Office, Provincetown, Massachusetts, on March 7, 1969, 
Street, Worcester, Massachusetts. Copyright by 


ed in the Editorial 
diately at 35 New 


partmen 


109 


110 JOURNAL OF PSYCHOLOGY 


2. Material 


a. Biographical Information Blank (Атту BIB). This 99-item scale 
contains life history, demographic, and attitude items. The scale is empirically 
keyed. 

b. Risk Scale. This 73-item scale is a revision of a BIB developed by 
Torrance and Ziller (7). A high score indicates an active and aggressive 
person who develops a male role early in life. 

с. Cooperation Scale. This scale contains 28 forced-choice tetrads (3). 
The examinee is asked to select the two statements that are “most descriptive" 
and "second most descriptive" of himself, 

d. Leadership Orientation Scale. This 56-item scale was derived from the 
Air Force study cited above on differential leadership expectations (4)- 
Statements describing NCO characteristics valued by superiors were paired 
with descriptions of NCO characteristics valued by subordinates. The ex- 
amince is asked to select the statements which best describe the petty officer 
that he, as an officer, would prefer to have in charge of the enlisted men. 

е. Peer Ratings. A measure of each officer's "aptitude for leadership and 
military service” was obtained during the 11th week at the Officer Candidate 
School. Each officer rates all the men in his section. Each man’s peer rating 


overall index of performance for each officer was obtained by averaging his 
scores on the following graphic rating scales: Performance, Desirability, 
Professional Knowledge, Cooperation, Judgment, Promotion Potential, Lead- 
ership, and Overall Rating. 

9. Junior Officer Activity Checklist (JOAC). This 20-item checklist was 
used to evaluate officers in the Fleet. This experimental checklist describes 
critical elements of job performance required of officers aboard destroyer-type 
vessels, Each definition of а critical requirement is followed by two illustrative 
specific incidents, The reporting officer indicated how often in the preceding 
six months he had observed the behavior described in the item, The areas of 
performance included responsibleness, discipline, coordination, consideration, 
communications, training, and specific job duties. A reliability estimate of .79 
was obtained by the Spearman-Brown prophecy formula, 


3. Procedure 


The experimental biographical inventories and other psychological tests 
were administered to the officer candidates shortly after they entered the Of- 
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ficer Candidate School. The officers were followed up to their duty stations a 
year later at which time criterion data were collected. To increase homogene- 
ity with regard to the job performance being evaluated, the data were ana- 
lyzed separately for the 264 officers assigned to Fleet duty and for the 229 
officers assigned to Shore duty. 

Product moment correlations were computed between the scores on each 
biographical inventory and the two criterion measures. With the exception 
of the Army BIB, which predicted performance of officers in Shore billets 
(r = .22; р < .01) as measured by the Fitness Report, none of the experi- 
mental inventories yielded validities higher than .16. The four inventories 
were subjected to item analysis in the hope of raising the validity coefficients. 

The officers in each group were randomly assigned to item analysis and 
cross-validation samples. The item analysis sample contained 129 officers in 
Shore billets and 164 officers in Fleet billets. The cross-validation samples 
contained 100 officers in Shore billets and 100 officers in Fleet billets. Items 
were selected that differentiated high from low criterions beyond the .20 level 
of confidence. It was found that very few items discriminated both in the 
Shore and Fleet samples. "Therefore, separate scoring keys were developed for 
each sample. These new keys were used to rescore the answer sheets of officers 
in the cross-validation samples. Then, product moment correlations were 
again computed between the total scores yielded by the revised keys and the 
official criterion measure. 

C. RESULTS 

The validities of two of the four biographical inventories were not signifi- 

cant in the Shore sample. The validity of the Army BIB did not hold up 


under cross-validation (r = .05; Р > .05). The validity of the Risk Scale 
did not increase as a result of selecting the most discriminating items (r = 


12; р > 105), 
The Leadership Orientation Scale correlated .21 (№ < .05), and the Co- 
operation Scale correlated .30 (5 < 01) with the Officer Fitness Report. 


The correlation between these two predictors was —.01. Adding the scores 
into one composite score yielded a correlation of .37 (р < .01) with the cri- 
terion. The practical value of this validity coefficient was evident when it 
was compared with the predictive value of other measures. With the exception 
of OCS Peer Ratings (r = .26; Р < .01), nothing in the battery of experi- 
mental and traditional aptitude and achievement tests, college grades, train- 
ing school grades, and interview ratings had any value for predicting officer 


success. 
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None of the biographical inventories predicted officer success in Fleet bil- 
lets as measured by the Fitness Report and the JOAC. The Leadership Ori- 
entation Scale had the highest validity against the JOAC (r = .16; p > .05). 
Peer Ratings, however, predicted the Fitness Report (r = .25; p < .05) and 
the JOAC (r = .29; p < 01). 


D. Discussion 


The results support the hypothesis underlying the construction of the Lead- 
ership Orientation Scale for the Shore based officers. Preferred leadership 
style was associated with later success in a leadership role as measured by the 
official criterion measure. Officer candidates who preferred task-oriented lead- 
ership to human-relations-oriented leadership were themselves rated higher on 
the job as officers. 


The finding that “human relations” officers do not necessarily perceive 
themselves as cooperative was rather surprising. Since both measures predicted 
the criterion, the combined measures yielded a higher level of prediction. The 
results suggest that an officer with a low self-image of cooperativeness who 
emphasizes “getting the job done" is less likely to receive as high a leadership 
rating as an officer who is both cooperative and task-oriented. Perhaps, group 
productivity was lower under the task-oriented, noncooperative officer. This 
possibility would be consonant with the finding by Katz, Maccoby, and Morse 
(2) that the production-centered supervisors achieved a lower level of pro- 
duction than employee-centered supervisors. The value of combining measures 
of leadership style and a personality dimension is also consistent with Pelz 
(6) who suggests that it is an oversimplification to try to relate employce- 
oriented versus task-oriented behavior to productivity. The writer suspects 
that task-oriented officers varying in cooperativeness may differ both in leader- 
ship practices and in the way they are perceived by their subordinates. 
| The results of this study support the value of subgroup analysis in validat- 
ing biographical inventories. None of the biographical inventories predicted 
success of officers assigned to Fleet billets. It may well be that different per- 
formance factors, personal qualities, and rater perceptions are involved in 
Shore billets as compared to Fleet billets. The fact that different kinds of 
items discriminated high from low criterion officers in the item analysis sug- 
gests such a possibility. This Possibility is also supported by differences in the 
prediction of success in Shore and Fleet billets by the Cooperation Scale. The 
difference between rs of .30 and —.08 was significant beyond the .01 level 
of confidence. Research in non-Naval settings has also turned up evidence on 
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this point; Vanasck (8) found that the personal characteristics required for 
success as a leader depend in part on the type of duty assignment. 

A more parsimonious explanation for the differences in prediction in the 
two situations might be made in terms of the small samples and the known 
low reliability of biographical type items. Further, the fact that OCS Peer 
Ratings predicted equally well the performance evaluations of officers in Shore 
and Fleet billets suggests that there may be elements in common to these two 
duty assignments. 

Such findings that a biographical inventory is valid in one situation and 
not in another has implications for further research. A job analysis may be 
needed to identify the situational requirements for effective leadership in the 
nonpredicted group. A job analysis also may help to develop a rationale for 
item selection and a priori scoring of new biographical inventories. 

There is a need to continue to conduct research on the selection and assign- 
ment of leaders according to their preferred style of leadership. The style of 
leadership that an individual implements is not something he can turn on and 
off. It is likely to reflect various personality patterns and values. Furthermore, 
a leader is more likely to be effective following a style in which he feels most 


comfortable. 
E. SuMMARY 


The best predictions of success as an officer on the job were obtained with 


self-descriptive biographical inventories measuring cooperativeness and a ten- 
dency to prefer a leadership style that "gets things done" rather than "human 


relations." 
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UNTIMED NORMS FOR THE SRA MECHANICAL 
KNOWLEDGE AND SHOP ARITHMETIC TESTS* 


Student Counseling Center, University of Wisconsin; and Psychology and 
Student Counseling Service, Iowa State University 


Kentner V. Frirz AND Martin Е. Fritz 


A. INTRODUCTION 


Under commercial conditions it is frequently desirable to administer tests 
promptly to individuals as they make applications for jobs. When time-limit 
tests are administered, there is not only a chance of error in timing but the 
clerical worker administering the tests, because of divided attention, cannot 
work efficiently at some other task. Moreover, a time-limitation when taken 
seriously by the applicant must often result in a certain amount of anxiety 
and unhappiness as indicated by the frequently heard remark, “I think I could 
have done better if I would have had a little more time!” And finally, from 
the standpoint of the test-user, it may be desirable to have every candidate 
show what he can do on a total body of content. Speed may not always be the 


primary consideration. 
B. PROCEDURE AND RESULTS 


We are presenting the test results of unselected, noncollege men, 80 per- 
cent high school graduates, who made application for various jobs requiring 
some degree of mechanical interest and knowledge such as truck driver, heavy 
equipment operator, equipment clerk, meter repairman, firefighter, power 
plant maintenance man, etc. All had been given the Otis Quick-Scoring 
Mental Ability Test, Gamma (OG-IQ), 30 minutes; and the Science Re- 
search Associates (SRA) Mechanical Aptitudes test, Form AH (1), Mechan- 
ical Knowledge (MK) and Shop Arithmetic (SA) but not the Space Rela- 
tions, administered without time limit. Although the authors designed the 
SRA test battery primarily for prediction of training potential, this was not 
the objective of the test administration in this study. The purpose was to get 
an estimate of the present state of knowledge of each applicant with respect 
to commonly encountered tools and mechanical devices, and competence in 


simple applied arithmetic. The tests may be considered as having good content 

* i through John A. Antoinetti of the Editorial Board; received in the Ed- 
e p reir Massachusetts, on March 3, 1969; and published imme- 
diately at 35 New Street, Worcester, Massachusetts. Copyright by The Journal Press. 
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TABLE 1 
AGE DISTRIBUTION ОЕ MEN FoR THE MECHANICAL KNOWLEDGE 
AND SHOP ARITHMETIC TESTS 


MK 
Age N=271 " 
18-20 26 
21-25 89 
26-30 49 
31-35 27 
36-40 17 
41-45 21 
46-50 21 
51-55 12 
56-60 9 


validity for this purpose, on the basis of which selections may be made for 
employment. 

The age distributions of men who took the MK and the SA tests are shown 
in Table 1. The OG-IQ distributions are shown in Table 2. 

Although the distribution for MK, Table 3, shows a considerable skew 
toward higher scores, this test is of value in making a judgment as to whether 
or not an individual has acquired the usual fund of information possessed by 
others concerning general mechanical situations. In practice, it has been found 
that those making the lower scores on this test may profess an interest in 
mechanical situations, but have not actually done much about it. The SA 
test, Table 4, resulted in a more balanced distribution. 

When the raw scores on the two tests were plotted against age and the 

TABLE 2 


Oris GAMMA 10 DISTRIBUTION ОЕ MEN FOR THE MECHANICAL KNOWLEDGE 
AND SHOP ARITHMETIC TESTS 


IQ N=271 N iv 
131-135 8 4 
126-130 17 10 
121-125 19 10 
116-120 29 16 
111-115 19 13 
106-110 31 28 
101-105 33 21 

96-100 37 30 
91-95 29 20 
86-90 15 9 
81-85 12 3 
76-80 11 5 
71-75 9 2 
66-70 0 1 
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OG-IQs, it was found that the distributions deviated markedly from nor- 
mality. For this reason it was deemed advisable not to calculate Pearson 
Product-Moment correlations but to make use of the Rank Differences 
method. The MK compared with age and OG-/Q gave Rho values of .09 
and .31, respectively. The equivalent values for SA are .07 and .67. From 


ABLE 3 
MECHANICAL KNOWLEDGE: SCORE, FREQUENCY, AND PERCENTILE DISTRIBUTION 
(М = 271) 

$соге Í Percentile 
45 4 99 
44 7 98 
43 15 96 
42 20 90 
41 35 83 
40 25 70 
39 28 61 
38 24 51 
37 14 42 
36 19 37 
35 17 30 
34 11 23 
33 9 19 
32 + 16 
31 s. n 
30 4 13 
29 4 11 
28 5 10 
27 6 ы 
26 2 $ 
25 : і 

3 

5 2 ; 
2 1 2 
15 : 1 
10 5 1 
1 1 


0 | " — 


d that the OG-IQ is related to the two tests, but to a 
£ MK. The moderately strong Rho value in 
t, in spite of the SA being a pictorial test and 
a rather strong element of what is 


this it may be inferre 
much less extent in the case 0 
the case of SA may indicate tha spit 
supposedly so applied, there still is in 1t 


oft refer o as mental а bility. 

en refe red to а ntal a 

An increase 1n scores obtained by time-limit testing is reported by SRA f 
Ў t 


both MK and SA from the ninth grade through the 12th. In this study, 
testicals йб relation with age Was found even though a cumulative effect 
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of experience would be expected. Although the explanation might be given 
that high-scoring older men, having long before found stable employment, 
were not well represented in the samples, the tests still constitute a face-valid 


method of handling the practical problem of employee selection, hopefully to 
be verified by later research. 


TABLE 4 
SHOP ARITHMETIC Score, FREQUENCY, AND PERCENTILE DISTRIBUTION 
(N = 177) 
Score f Percentile 
24 1 99 
23 5 99 
22 5 97 
21 5 94 
20 10 91 
19 8 85 
18 9 81 
17 12 76 
16 11 69 
15 14 63 
14 19 55 
13 12 44 
12 10 37 
11 11 32 
10 16 25 
9 6 14 
8 6 13 
vd 9 10 
6 2 5 
5 2 3 
4 4 2 
REFERENCE 
1. SRA MECHANICAL APTITUDES. Grades 9-12 and adults, Chicago, Ill.: Science Re- 
search Associates, 1947, 1950. 
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THE ROLE OF VERBAL FACTOR IN А FORCED-CHOICE 
TEST OF ACHIEVEMENT VALUE* B 


Department of Psychology, York University, Canada 


Bisuwa МАТН MUKHERJEE 


A. INTRODUCTION 


Although it has been observed by many that the verbal ability of the S 
has a profound impact upon how he responds in a personality questionnaire, 
have been reported in this connection. Seigman 


only a few empirical studies 
e correlation between scores on the vo- 


(14) found, for example, a negativ 
cabulary test of the Shipley-Hartford Retreat Scale and the California F 


Scale. The correlation was found to be —.38, significant at .05 level. Mes- 
sick and Kogan (6) in a factor-analytic study also isolated in their male 
sample a verbal comprehension factor marked at one end by a vocabulary 
test with a loading of .71 and the other end by a modified F Scale with a 


loading of —.45. 

With Taylor Manifest Anxiety Scale (MAS), the results have been some- 
what contradictory. Calvin, Koons, Bingham, and Fink (1) reported a sig- 
nificant negative (—.29) correlation between MAS and the Vocabulary sub- 
test of Wechsler Bellevue Intelligence Test. Kerrick (4) also found that 
scores on Word Knowledge Test of AAF battery correlated .40 with MAS 
128 Air Force trainees. Matrazzo, Ulett, Guze, and Saslow 
(5), however, had shown that composite scores derived from the Compre- 
hension, Vocabulary, and Similarity subtests of the Wechsler Bellevue Scale, 
given in group administration to 101 college sophomores, correlated —.07 
only with the MAS scores. Mukherjee (8) reported a correlation of .004 
and .065 with MAS scores on the one hand and the Word Ending and Word 
Beginning Tests, respectively. It appears then that scores on the MAS may 


be relatively more independent of both verbal comprehension and verbal flu- 


ency factors than the California F Scale. 


in a sample of 


* Received in the Editorial Office, Provincetown, Massachusetts, on March 6, 1969, 
and published immediately at 35 New Street, Worcester, Massachusetts. Copyright by 

The Journal Press. 
1 The research reported herein was supporte! 
Education, United Stat 


Program of the Office of i 
"s period of stay at 
119 


d through the Cooperative Research 
es Department of Health, Education, 
Indiana University. 


and Welfare during the author 


120 JOURNAL OF PSYCHOLOGY 


In the area of questionnaire measurement of achievement-related needs 
(v Ach), not a single study has been reported so far which deals with the 
role of verbal factors. The present study was taken up for assessing the im- 
portance of two verbal factors in determining the v Ach scores of American 
college students. Specifically, the object of the study was to determine the 
extent poor v Ach scores are due to vocabulary limitations on the part of 
respondents. 


B. METHOD 
1. Subjects 


The Ss were 87 freshmen enrolled in an introductory course offered during 
fall 1963 semester at the Southeastern Campus of Indiana University, Jef- 
fersonville, Indiana. A detailed description of the sample appears elsewhere 


(9). 
2. Tests 


Two measures of verbal ability—namely, vocabulary and completion— 
were taken from the original 57 tests of Primary Mental Abilities (PMA) 
used by Thurstone (15) and published by him as a supplement (16). In 
addition to these tests, Thurstone and Thurstone’s (17) Suffixes and Pre- 
fixes as adapted recently by Educational Testing Service, Princeton, New 
Jersey, under the titles Word Endings and Word Beginnings, respectively 
(3) were used as predictors. These two tests are supposed to measure the 
word fluency factor (3), since both require the Ss to produce rapidly words 
that fulfill certain requirements, 

For measuring achievement values (v Ach), the Sentence Completion Test 
(SCT) developed by Mukherjee (7, 8) was used. The SCT consists of 50 
forced-choice triads in which items having comparable social desirability val- 
ues based on the ratings of American college students samples are arranged 
in groups of three so that each member of the triad represents a different psy- 
chological need. Each triad, however, has an item the endorsement of which 
reflects achievement orientation. The total number of such achievement re- 
lated items endorsed by the § operationally defines his v Ach. A high score on 
the SCT is interpreted as a keen desire to compete successfully with a stan- 
dard of excellence, an expressed interest in undertaking difficult and challeng- 
ing tasks, and a strong sense of optimism. Details concerning the construction, 
administration, scoring procedure, and various psychometric properties of SCT 
appear elsewhere (7, 8). The Kuder-Richardson reliability estimate for SCT 
has been found to be .716 for a sample of 248 students enrolled in Indiana 
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University. The test retest reliability of the SCT after an interval of nearly 
two months has been found in various studies to range from .71 to .83. Stud- 
ies demonstrating the concurrent and construct validities of SCT as a mea- 
sure of achievement value have been reported elsewhere (8, 10, 11, 12, 13). 


3. Procedure 


Mimeographed copies of SCT were distributed to all the Ss and they were 
urged to fill out and return the test booklet and the completed answer sheet 
within a week. All the 5s filled out the questionnaire in their leisure time in 
псе with the instructions contained on the first page of SCT booklet, 
and turned in the material within three days. In addition to SCT, each mem- 
ber took a large number of psychological tests including 20 tests from Thur- 
stone's PMA, the Word Beginning, and Word Ending Tests and several other 
ability tests. These tests were administered during the regular class hours be- 
mber 13, 1963 and December 17, 1963. These tests including the 
the present study were administered in groups of 
f a larger project on learning and structure 
himself, one research assistant was 
all tests. Since the whole 
ticipant will be 


accordar 


tween Nove 
four verbal tests used for 
nearly 40 Ss at a time, as a part o 
of abilities. In addition to the investigator 
responsible for the administration and proctoring of 
ake the tests on the condition that each par 

{ an optimal motivation on the part 


idence o 
administered at a time, and in each case 


between two tests. The testing sessions 
ided optimal conditions for the taking 


group agreed to t 
ater on, there was ev 
an four tests were 
five-minute rest 
hich prov 


counseled 1 
of Ss. Not more th: 
there was at least a 
were held in the classroom wW 
of psychological tests. 

C. RESULTS 
ercorrelations among the verbal tests, as well as 


SCT, are reported in Table 1 along with the mean and standard deviation of 
cach variable. Multiple correlation between all the four verbal tests on the 
one hand and the SCT on the other hand was also computed. The multiple 
turned out to be 26; and, thus, the hypothesis of zero correlation 
rs and SCT could not be rejected. 


The product moment int! 


correlation 
between the verbal predicto 


D. Discussion 
per indicate clearly that v Ach scores on the 
he verbal fluency nor by the vocabulary lim- 


t appears that the level of vocabulary 
sample of 


The results reported in this pa 


SCT are neither determined by t 


itations on the part of the respondents. I a й 
used in making the items of SCT was relatively easier for the 
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American college freshmen studied. However, it should not be interpreted 
that the items of SCT would have necessarily the same meaning for all Ss 
who took the test. This may or may not be true. The assumption of an iden- 
tical meaning of a particular item across the population of Ss is not partic- 
ularly involved in the construction of SCT. 

Since “the word fluency factor W is involved whenever the subject is asked 
to think of isolated words at a rapid rate" (17, p. 110), as is called for in a 


TABLE 1 
PRobUcr MOMENT INTERCORRELATIONS! AMONG SELECTED VERBAL TESTS ОЕ PMA AND 
SENTENCE COMPLETION TEST ALONG WITH THE MEAN AND 
STANDARD DEVIATION OF EACH VARIABLE 


(№ = 87) E : 
Я Word Word 
Variable SCT Completion Vocabulary Ending Beginning 
Sent. Completion EBEN === 5 
est М 1.000 .200 472 .007 —.066 
Completion .200 1.000 :545 .269 .287 
Vocabulary 172 545 1.000 .339 317 
Word Endings 007 .269 .339 1.000 548 
Word Beginnings —.066 .287 517 .548 1.000 
Меап 15.46 21.82 40.63 28.43 23.01 
Standard 
deviation 4.88 5.88 11.51 7.62 6.05 


* In order to be significant at .05 level, the required value is 21. 


test of rhyming, or in reproducing words with a given letter, prefix, or suffix, 
it is no wonder that SCT scores showed almost a zero correlation with scores 
on Word Beginning, as well as Word Ending Tests, 

It is of supplementary interest to note the positive but quite low correla- 
tions between scores on Vocabulary Test and the two measures of word flu- 
ency used in the present study, The results indicate that, although the Word 


Beginning and Word Ending Tests measure some kind of familiarity with a 
wide range of words, it is not the 5: ki 


elicited by measures of verb 
PMA Vocabulary Test, The disti 


d ¢ А two verbal factors 
a. noted from time t, -analytic studies (2, p. 
198). 


The results reported here appear to indicate that the y Ach scores of SCT 
have little verbal saturation jn the population of American college freshmen. 
The notion that 8s having high achievement values will tend to have a more 
enriched vocabulary than gs with low v Ach scores is not supported by the 
data of the present study, The Study also shows that achievement orientation 


En 
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of American college freshmen cannot be predicted if we know about their 
strength and weakness in the verbal domain. To what extent such a general- 
ization will be true for other types of populations merits further study. 


E. SUMMARY 


In order to investigate if scores on Sentence Completion Test (SCT), a 


forced-choice scale of achievement value (v Ach), are dependent upon verbal 
fluency and vocabulary, 87 freshmen studying at Indiana University, Jeffer- 
sonville, were given the Word Beginning, Word Ending, and Vocabulary 
Tests after they took SCT. The multiple correlation between scores on SCT 
significant .26. It appears that Ss’ 


and verbal tests turned out to be a non 
verbal fluency and vocabulary play a negligible role in determining their 


scores in SCT. 
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AN INVESTIGATION OF SUCCESS PREDICTORS IN 
GRADUATE SCHOOL IN PSYCHOLOGY* 


Institute of Behavioral Research, Texas Christian University 


James К. RawLs, DONNA J. RawLs, AND C. WADE HARRISON 


A. INTRODUCTION 


In selecting students for graduate work in psychology, most departments 
usually use some combination of GRE or MAT scores, undergraduate grade 
and letters of recommendation. Yet, from a practical standpoint, 


point average, 
ly weak, accounting for only a small proportion 


these measures are predictive 
of the variance in the criterion measures (1). 

Obviously, one of the reasons for the failure of these predictive measures to 
demonstrate later discriminative efficiency is the very fact that they are used 
in the selection proces In effect, by eliminating students who were less suc- 
cessful on the predictive measures, much of the relevance to later criterion 
d. Truncating the distribution in this manner and then 
resultant homogeneous sample could hardly be ex- 
predictive validity coefficients. 

Still another problem is that of adequate criterion measures. Studies to date 
have utilized comprehensive examinations, faculty ratings, student ratings, 
and graduate grade point average; yet, ratings, including those of performance 
on comprehensive exams, have demonstrated far less than optimal reliability. 
Furthermore, even without considering the subjectivity in assigning grades, 
the limited range (usually only A's, B's, and C's) restricts their capacity for 
prediction. Morcover, obtaining а Ph.D. in psychology is dependent upon 
v more than passing an exam or making good grades: e.g., perfor- 
: ations, successful completion of a thesis, and successful 


measures is remove 
making predictions on the 
pected to produce impressive 


considerabl 
mance in oral exanun 


on of a dissertation. 
£ the problems, there is a greater need for predictors of success 


day than ever before in history. There is an increased 
demand, fOr advanced degrees, and there are ever-increasing numbers of 
fill available slots. Therefore, concentrated attention will need 
these problems and finding ways and means 


completi 
Regardless 0 
in graduate training to 


applicants to 
to be focuse 


ee 


d upon resolving 


; itori Б vi vn, Massachu March 7, 1969 
: iv he Editorial Office Provincetown, setts, on March 7, 1969, 
"e Received in mediately at 35 New Street, Worcester, Massachusetts. Copyright 
pu a у 
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whereby we can make better and more sophisticated predictions of success in 
graduate training than we have done in the past. 

The purpose of the present study was to investigate the utility of a number 
of predictor variables, both new and old, in discriminating between three 
groups of students in psychology at Louisiana State University: viz., those 
obtaining Ph.D.’s, those receiving terminal master’s degrees, and those who 
were unable to obtain either degree at L.S.U. 


B. METHOD 
1. Subjects 


The subject Population consisted of 75 students who were enrolled in the 
graduate program in psychology at Louisiana State University between Sep- 
tember, 1961, and August, 1965. Of these, 25 subjects obtained the Ph.D. 
during this time, 25 obtained a terminal master’s degree, and 25 were unable 
to obtain either degree. This latter group was not allowed to continue in the 
Program after their first year because of academic failure. There were 65 


Data collected included MAT and GRE scores, letters of recommendation, 
a few biographical items, and an essay answering the question, Why I want to 
go to graduate school in psychology." In addition, transcript information was 
recorded in order to show the number of hours taken within each subject 
area and the number of hours within each area of A's, B's, C's, etc. Thus, in 


average, grade point averages were calculated 


C. RzsuLTS 
The data were analyzed by 


means of appropriate ?’s and ?'s. As shown in 
Table 1, results of the present . 


study indicated that those 8s who obtained the 
Ph.D. had had more undergraduate hours in psychology, biological sciences, 
and math, and fewer hours in the humanities than either terminal master's Ss 
or Ss not obtaining a graduate degree, Ph.D. Ssh 


ad had more hours in foreign 
languages than M.A. Ss, but not more than the 


no degree Ss, Ph.D. Ss and 
| i taken more undergraduate hours in the natural and 
physical sciences and combined sciences than Ss not obtaining graduate degrees. 
The number of hours in the natural and physical sciences and combined 
sciences were the only two significant differences that were found in the 


entire study between M.A, and no degree $$. 
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Ss who received the Ph.D. had higher grade point averages in humanities, 
math-science courses, and in the combined sciences. Ph.D. Ss had higher grade 
point averages in the natural and physical sciences than no degree Ss, but not 


higher than master’s degree Ss. 


TABLE 1 
SIGNIFICANCE LEVELS OF DISCRIMINATING PREDICTOR VARIABLES 
ы Ph.D.- M.A.- 
Variable Ph.D.-M.A. No degree No degree 

Hours in 

Humanities* < .20 < .20 

Foreign languages «10 

Natural and physical sciences < .20 < 20 

Mathematics < .20 < .20 

Biological sciences < .20 < 0 

Psychology < .05 < .05 

Combined sciences < 120 < 20 
Grade point average in 

Humanities < .20 « 10 

Natural and physical sciences < .05 

Math-science < 20 < .05 

Combined sciences <.20 < 20 
Biographical information 

Age at admission < .01 < .01 

Married at admission < .001 < .001 

Entering with М.А. « .01 <.01 

Large undergraduate university <.10 « 0 

uencies in each case except the hu- 


* Ph.D.'s and М.А had higher means ог freq 
manities. 
With regard to the biographical items, Ss who obtained a Ph.D. as com- 
pared with the other Ss were older when entering the program, were more 
often married upon admission, and more likely entered with a master's degree. 
Ph.D. Ss also came from larger undergraduate institutions than did the Ss in 


the other two groups. 


In an attempt to make some predictive use of letters of recommendation, 


ariety of procedures ranging from counting the number 
and adverbs to having judges rate the letters on various 
as effective in differentiating the three 
tried in order to make some 
"Why I want to go 


use was made of a v 
of positive adjectives 
criteria. None of these procedures W 
groups. Similarly, a number of techniques were 
predictive sense out of the essay answering the question, 


to graduate school in psychology,” but to no avail. А | 
Despite the large number of variables investigated, no other variables dif- 


ferentiated the three groups. Variables which failed to reach significance are 


listed in Table 2. 
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TABLE 2 
VARIABLES FAILING To DISCRIMINATE 


Test data 


MAT 
GRE-Verbal 
GRE-Quantitative 
GRE-Total 
GRE-Psychology 


Hours in 


Art 

Education 

English 

Music 

Philosophy 

Physical education 

Sociology 

Social science 

"Theology 

Combined sciences and math 


Grade point average in 


Art 

Biological science 
Education 
English 
Languages 

Math 

Music 
Philosophy 
Physical education 
Psychology 
Sociology 

Social science 
Theology 


Combined psychology and social science 
Overall 


Location of BA/BS conferring school 
N.-S. 
E.-W. 
S.-not S. 
NE-SE-NW-SW 
Degree upon entrance 


B.S.-B.A, 
M.S.-M.A, 


Undergraduate major 


Psychology-nonpsychology 
Social Science-nonsocial science 


Sex 
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D. Discussion 

Despite a rather large number of variables and an unusually low acceptable 
level of confidence (р < .20), few variables showed promise in their capacity 
to discriminate successful, less successful, and unsuccessful graduate students 
in psychology. As indicated, both the MAT and GRE failed significantly to 
differentiate the three groups. Likewise, letters of recommendation and the 
essay on “Why I want to go to graduate school in psychology” did not dis- 
criminate between the groups. 

The partitioning of transcript information did show some glimmer of use- 
fulness in that Ph.D.’s, as compared with the other Ss, had had more under- 
graduate hours in psychology, foreign languages, biological sciences, math, 
natural and physical sciences, and all sciences combined. They also had higher 
grade point averages than the other Ss in the humanities, natural and physical 
sciences, math-science courses, and combined sciences. 

Biographical information also showed some encouraging predictive value in 
that Ph.D. Ss were older than the other Ss when entering the program, were 
more often married upon admission, and more frequently entered with a 
master’s degree. They also more often obtained their undergraduate degrees 
from a larger university than did the other Ss. 

As indicated by their overall undergraduate point average, MAT and GRE 
accepted into the L.S.U. psychology program were 
1 to their intellectual ability. It would thus ap- 
pear that success in graduate training was dependent upon some factor other 
than sheer brain power: e£» interests, motivation, or personality. But what- 
ever the factor may be, it seems fitting to look for predictors outside the realm 
of intellectual ability so long as students are carefully screened. before enter- 
ing the program. Results of this study indicate that past experience in course 
work areas, and biographical information (age, marital status, etc.) appear 
predictive potential, In light of recent successes in personnel 
with the use of the Biographical Information Blank, the 


and similar devices, biographical data may prove to 


estigation f duate 


scores, graduate students 
very homogeneous with regarc 


to have some 
selection in industry 
Personal History Survey, 


be a very fruitful area of inv or predicting success 1n gra 


training. 
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THE EFFECT OF SAMPLE ITEMS ON TEST PERFORMANCE* 


Institute for Child Study, University of Maryland; and 
Department of Education, Purdue University 


DowALD W. FELKER, GERALD LansoN, FRANK MILHOLLAN, 
AND Joun К. PYPER 


A. INTRODUCTION 


It has been found that questions inserted in filmed or printed instructional 


materials facilitate the learning of the information covered by the inserted 


questions and also the learning of material untested during the study period 


(1, 2, 4, 6, 7). 

It should be noted that 
with inserted questions on 
are seen just before the relev 
to the questions are given, on 
improved (1, б, 7). If the material is quite 
parently have little or no effect (4), as well as if the important information 
in the text is cued by underlining, printing in red, etc. (2, 3). 

Additional research findings suggest that there are conditions under which 
the facilitative effect of inserted questions is selective. Ss exposed to inserted 
questions on specific types of information, such as common and technical 
phrases or measures and names, tended to learn more of the type of informa- 
tion involved in the inserted questions (7). | 

In spite of the conclusions of the above mentioned researchers concerning 
classroom and textbook procedures, Рурег (5) һаз argued that the findings are 
as yet of dubious value for practical application because (a) the learning 
tasks of the research activities have been short instructional experiences which 
are atypical of the learning atmosphere and environment of the classroom, 
(5) the task is not course imbedded and hence raises the question of the Haw- 
thorne effect, п instructional procedure of not allowing the 


and (c) the research т 
Ss to backtrack and reread material is unrepresentative of practically all 
s to ba ack 4 

m instruction. 


the generally facilitative effect on learning associated 
ditions. If the questions 


ly occurs under certain coni 
t after, or if the answers 


ant material is read vs. jus 
ly the ability to answer the inserted questions is 
interesting, inserted questions ap- 


classroo 


€"— 
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B. PnonLEM 


The present study was designed to investigate the effect on student attitude 
and learning of handing out sample test items when the students are pre- 
Paring for a test, This approach imbeds the inserted question procedure in an 
ongoing instructional Program. The need to Prepare for a classroom test should 
serve as motivation for mastery of material. 

The following hypotheses were tested: 

1. Ss given sample items for study prior to a test will correctly answer 
more test questions involving the material covered by the sample items than the 
material not covered by the sample items, 


2. Ss given sample test items for study will perceive the Procedure as being 
helpful in Preparing for the test. 


C. METHOD 


1. Subjects 


Students enrolled in a master’s level course in I 


University of Maryland Were selected for the study, 
dents each Served as subjects. 


«earning Theory” at the 
Two sections of 28 stu- 


2. Procedures 


A midterm test of 112 items covering four chapters in the required textbook 


was prepared, The test Was then given to Section I o£ the course, Item anal- 
ysis data from the responses of the students in Section I was used and two tests 
were compiled, One test consisted of 25 items from chapters 1 and 3, A second 
test consisted of 25 items from chapters 2 and 4. With the use of the item 
analysis data, items on the two tests were matched for difficulty level and 
item discrimination, 

Items not included in the two 25-item matched-tests, and only covering ma- 
terial in chapters 1 and 3, were landed out to Section I1 of the course. These 


items were Presented as sample items to Prepare the students for the test and 
to aid them in studying. One week later the two matched-tests of 25 items 


each were combined into а 50-item test and given to the students in Section II 
as a midterm examination, After the test cach student answered the following 
question : "Did you feel that the test questions distributed to you were helpful? 
Very much. Quite a bit. Not much. Not at all.” 

The differences between the scores on the two Matched tests were com- 
pared in testing the first hypothesis that Ss given sample items for study prior 


to а test will correctly answer more test questions involving the material 
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covered by the sample items than the material not covered by the sample items 


The analysis is presented in Table 1. 


ANALYSIS О! ORES ON A TEST OVER MATERIAL WT 
ANALYsIS OF VARIANCE OF SCORES A M TH SAMPLE 
V T ATERL 
TEST ITEMS AND A TEST OVER MATERIAL WITH No SAMPLE ITEMS 


m Source $$ df MS F 
Between subjects 992.9 27 36.7 
Between tests 18.3 1 183 po" 
Error 79.7 27 2.9 К 
Total 1090.9 55 

* «05. | 


D. RESULTS 


The analysis indicates that the students scored significantly higher on the 
test covering chapters from which sample items were taken (X — 1542) 
than on the test covering chapters from which no items were taken X = 
14.28). The F-ratio for tests was 6.31 with 1 and 27 degrees of freedom, 


^ < 05. 
The performa 
received sample test ite 


ts of test items of the Ss who 
ms and those who did not receive sample items was 


compared. Group one, which had no sample items, had the following pro- 
portions of correct responses on the tests: (a) test over chapters one and three, 
59.68; (b) test over chapters two and four, 59.84; and (c) total for combined 
tests, 59.76. Group two. which had sample items from material in chapters 
one and three, had the following proportions of correct responses on the tests: 
(a) test over chapters one and three, 61.71; (b) test over chapters two and 
four, 57.14; and (c) total for combined tests, 59.42. 

It is apparent that there is no difference in the total proportion of items 
answered correctly for either condition. Thus the rise in the proportion of 
nswers to items which covered the material represented in the sample 
a decrease in the proportion of correct answers to 
presented in the sample items. 
hat Ss given sample test items for 
helpful in preparing for the test, 
cerning the helpfulness 


nce on each of the two se 


correct a 
items was accompanied by 
items which covered information not те 

In order to test the second hypothesis id 
study will perceive the procedure as being 


the distribution of responses to the posttest question соп 
of the sample items was analyzed with the use of chi square. The number of 


students choosing each response was Very Much, 12; Quite a Bit, 10; Not 
Much, 2; and Not at all, zero. Four students did not mark their answer 
э “y © 


sheets. 
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A random distribution for expected frequencies was assumed and the chi 
square equals 17.333, which has a p < .01. 


E. SUMMARY AND Discussion 


The results can be summarized as follows: significantly more students said 
that sample items are helpful than students who said they are not helpful. 
Giving students sample items was associated with higher performance on test 
items covering material from which sample items were taken. Finally, students 
who had sample items did not attain an overall test achievement which was 
any higher than students who had no sample items. 

The generalized yet selective facilitative effect of the sample test items 
is consistent with the finding of Rothkopf and Bisbicos (7). Such findings 
Suggest that sample items result in increased posttest ability on material from 
which the items were taken but decreased posttest ability on material from 
which no items were taken. The findings of the present study would indicate 
that sample items before a test can affect learning, but that concern should 


be given to how the material in the sample items is distributed among the 
total material to be studied. 
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COMPARISON BETWEEN AMERICAN PERMANENT 
RESIDENTS OF ISRAEL: PART П, 
ISRAELI BACKGROUND*? 


B’nai B'rith Hillel Foundation at Purdue University 


GERALD ENGEL 


A. INTRODUCTION 


Native American Jews who became permanent residents of Israel during 
the five-year period 1962 through 1966 provided information in 1967 about 
their American background and adjustment to Israel. Their responses relating 
to life in America were presented in Part I, as a comparison between the 
religiously oriented and the nonreligious.” Included in Part I was an anal- 
ysis of the minority’s reaction to à majority culture, identification with Amer- 
ican Jewish life, and interest in Israel (2). 

Both groups were equally disturbed by materialism in America. To a lesser 
degree the religious and nonreligious were equally troubled by anti-Semitism. 
The nonreligious were more disturbed by conformity and political witch- 
hunting; while the more Jewishly group-identified religious subjects were 
more concerned about the possibility of assimilating. The religiously oriented 


more interested in Israel than were the nonreligious. 
e same religious, nonreligious comparisons as in Part I. 
bjects’ attractions to Israel, feeling upon arrival and 
arison between working conditions in Israel and 
ated to Israel chose to reduce con- 
mmigration). By contrast the 


were also 
Part II maintains th 


Part II deals with su 
afterwards, and their comp 
America. The American Jew who immigr 
flicts between belief and action by aliya (i 

Massachusetts, on March 10, 1969, 
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F'n Israel 255 subj i ified with Jewish religious tradition: 87 traditionalists 
(masorti) Я commandments (shomer mitzvot); 138 observant. (dati). 
One hundred eighty-eight were designated nonreligious in Israel: 27 against bb- 
servance (antidati) ; 80 not observant (lo dati); 55 secularists (chiloni) ; 24 who 
wrote “don’t know”; two Hebrew Christians. 
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American "alienated by his own commitment" (1) who remains in his native 
land generally modifies his own attitude to fit into the system. The indi- 
vidual who goes to live in Isracl is responding to an inner signal to avoid 
increasing dissonance (3). Subjects in conflict about aspects of American life 


and about Israel’s attraction are discussed in terms of an expanded theory of 
cognitive dissonance. 


B. Ѕувјест 


came permanent residents of Israel during the years 
1962 through 1966, who were 20 years of age or older by the end of 1966. 


Seventy-seven subjects were eliminated, including six who said they left 
America as infants, six who were deceased, and 65 who were not in Israel 
from March through August 1967 when this questionnaire was administered. 
Of 703 potential subjects 443 cooperated, representing а 63 percent response. 


C. PROCEDURE 

All subjects were mailed a four-page, 
spondents included 364 who voluntarily answered one of four mass mailings 
and 79 other subjects who were considered forced, responding to personal 
letter or interview, When few statistical differences were observed between 
these groups, the subjects were combined. The comparison presented between 
religiously oriented and nonreligious is based upon subjects’ religious align- 
ment in Israel, and doe reflect their religious outlook when 


in America (2) 3 


self-administering questionnaire, Re- 


S not necessarily 


D. Resutts 


Several hypotheses Were postulated upon the assumption that the reli- 
giously oriented, more Jewishly identified in Americ 


à a, would be more attracted 
to, and experience greater satisfaction living in Isr. 


One assumption was confirmed : 
to a Jewish Israel than уу 


acl than the nonreligious. 
the religiously oriented were more attracted 
ere the nonreligious (Table 1). 

Other conjectures were rejected. "Hate w 
career opportunities attracting settlers CT 
migrants) before settling shared the 
within Isracli income; 66,5 percent ex 


3$ no group difference about 
able 1). Both groups of ойт (im- 
Same degree of optimism about living 
Pected it to be easy. 


3 See footnote 2. 


„Юю чэ - с 


vr vea. 
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TABLE 1 
JsRAEL’S ATTRACTION 
Percentages 
Nonreligious Religious] i 
_ Question (N 188) Cl mm 
Homeland (N 172) (N 235)** 
none 18.0 8.1 
some 32.6 17.0 
much 494 74.9 
Land of Bible (N 166) (N 216)** 
none 304 12.5 
some 44.0 29.6 
much 15.7 57.9 
Jewish life (N 167) (N 230)** 
none 31.1 8.3 
pue 41.5 17.8 
mieh 27.5 73.9 
ive апела, 7248 (N 170) (N 230)** 
anne 24.7 8.7 
324 21.7 
some 42. 9 69.6 
much 2 н 
os N 164) N 205 
Career opportunities ( 59.1 ( 209) 
попе 28.7 29.8 
some 122 9.3 
much (№ 317)** 
Г 
Jewish education WwW E 184 
nonc 234 23.5 
some 34.7 58.1 
much N 169)* (N 203) 
Kibbutz life n 65.5 
none 24.3 16.7 
some 23.1 177 
much = Pagel 
BMD02s— Contingency Table Analysis—Version of 
University of California at 


erived from 


анһ Science ting facility, 


Note: All tables d 
March 11, 1964, He 
Los Angeles. - 

* Chi square significant с. 

** Chi square sign! cant di 

" ; ‘val to settle. Thirty-five per- 
Bevan ee ба Bee of arriva E i y р 

There was no group difference í 25 
cent before аре 25; and 34.3 percent between age 25 and 85: 

cain imilar feeling of ease among both groups of settlers, encom- 

ere was а simile А a vear la , 
passing 74.5 percent UPS” a; d E Team da pe : зенан 
i sponding. Yet, EF pip: deg print 

at time of respo ‘Ks language of the land. Upon arrival 54.5 percent of 
Hebrew, н of the in conversa- 


d nt religious had no ability i 
and 30.7 ме 80.5 percent of the nonreligious and 93.5 
n res 


s Compu 


.05. 


differences — 301. 


fferences — 


94.3 perc 


was and i 


municate in 
the nonreligious 
tional Hebrew- Whe 
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percent of the religious report much ability in conversational Hebrew (chi 
square significance at .001 level). 


The nonreligious working in Israel feel there is more opportunity in their 
fields when comparing Israel to America than do the religious. Nevertheless, 
there is similarity of feeling between groups about income, satisfaction, pres- 
tige, and unemployment in their field, as well as standard of living (‘Table 2). 

Real occupational differences exist with the chi square significant difference 
at 01. More of the nonreligious are professionals (27.7 percent of the non- 
religious compared to 22.0 percent of the religious) and farmers (9.0 percent 
of the nonreligious, 3,1 percent of the religious). 

Among these immigrants 89.5 percent agree that their housing is adequate. 
More than half (57.3 percent) assert living 


The kibbutz was More attractive to the 
(Table 1). Mor 


within their income is easy. 
nonreligious than to the religious 
€ of the nonreligious live in rural arcas (27.7 percent of 


TABLE 2 
ISRAEL COMPARED TO AMERICA 
Percentages 
Nonreligious Religiously oriented 

Question (М 188) (N 255) 
Opportunities in field (N 143)* (N 184) 
ess 48.3 44.0 

same 23.1 37.0 

more 28.7 19.0 
Income for work (N 139) (М 175) 
less 80.6 80.6 

same 12,2 9.7 

more 7.2 9.7 
Satisfaction in work (N 153) (N 193) 
less 19.0 19.7 

same 35.3 32.6 

more 45.8 47.7 

Prestige in work (М 148) (N 179) 
less 25.7 29.6 

same 35.1 31.8 

more 39.2 38.5 

Standard vim (N 172) (N 215) 
76.7 68.8 

same 19.2 22.8 

more 41 $4 
Чекара in my field (N 91) (N 114) 
ew 28.6 21.1 

same 50.5 50.9 

more 20.9 28.1 


* Chi square significant differences — .02. 


H 
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the nonreligious, 16.5 percent of the religious at chi square significant differ- 
ence .005). Of all American olim (immigrants) 21 percent live in rural areas 
compared to 11.6 percent for Israel (5) and between three and four percent 
of Jews in the U.S.A. (6). 

E. Discussion 


oriented American Jews were more attracted to a Jewish 
Israel than were the nonreligious. Nonreligious subjects found kibbutz life 
d the religious. Both groups of immigrants, responding 
to many needs, settled in Israel at the same ages (4). Both groups were 
after arrival, a fact which was evident in their assertions 
ease of living within income, and 


Religiously 
more appealing than di 


equally comfortable 
about fecling at ease, adequacy of housing, 


job satisfactions. 

The religiously oriented were more attracted to a Jewish Israel than the 
nonreligious, though the Jewish attractions of the land were paramount to 
both groups. Despite dissimilarities, the overall view of America and Israel 
created the same total dissonance within both groups. This was reflected in 
similarities of age when settling, and similar feelings of ease upon arrival, 


though differences existed between groups in ability to speak Hebrew. In 
1967, both groups coincided in their views about housing, living within Israeli 
income, and work satisfactions, notwithstanding real differences in occupation, 


and expressed disagreement about occupational prospects. 
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APPROXIMATE RANDOMIZATION TESTS* 


Department of Psychology, University of Calgary 


EUGENE S. EDGINGTON 


A. INTRODUCTION 


In an article on randomization tests, approximate randomization tests were 
briefly discussed (1). This paper will consider approximate randomization 
tests in further detail. First, however, it is necessary to correct a statement 
in the earlier article. In a randomization test, the significance of an obtained 
statistic is the proportion of the statistics in the sampling distribution that 
tistic, not, as was erroncously stated, the pro- 
atistic. The correction is required for the 
under the null hypothesis, 


are as large as the obtained sta 
portion that exceed the obtained st 
computed significance value to be the probability, 
of obtaining such extreme results. 

B. TERMINOLOGY 


The term entire sampling distribution will be used to refer to the sampling 
distribution of a randomization test consisting of all divisions or pairings of 


measurements that are equally probable under the null hypothesis and random 
assignment, or to refer to the distribution of statistics associated with the dis- 
tribution of divisions or pairings. For example, given the null hypothesis of 
no difference between the effects of Treatments A and B and random assign- 
ment of 10 subjects to A and five to B, the entire sampling distribution of 
divisions consists of every possible division of the 15 obtained measurements 
that assigns 10 to Treatment A and five to Treatment B. If one were to 


difference between means for each division in the distribution, the 


compute a 

distribution of differences between means thus obtained would be the entire 
tion of differences between means. 
bution is a smaller distribution of pairings 
a distribution of statistics associated with 


le of all of the pairings or divisions 


sampling distribu 

An approximate sampling distri 
or divisions of measurements (or 
them) obtained by taking a random samp. 
in the entire sampling distribution. 


The term approximate randomiz. 
that uses an approximate sampling distribution instead of the entire sampling 


distribution in determining the significance of an obtained statistic. 
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C. PURPOSE oF APPROXIMATE RANDOMIZATION TESTS 


The advantages of randomization tests over parametric and rank order tests 
have been widely recognized. The only serious objection that has been raised 
against the use of randomization tests is that the amount of computation re- 
quired is excessive. The purpose of approximate randomization tests is to 
reduce the amount of computation for a randomization test to a practical 
level, thereby overcoming this objection. The following example will illustrate 
the reduction in computation that can result from using an approximate 
randomization test. 


Suppose that in an experiment 13 subjects are randomly assigned to 13 
different magnitudes of an independent variable and one measurement is 
obtained from each subject. The null hypothesis is that variation in magni- 
tude of the independent variable has no effect on the dependent variable; i.e., 
at the time he was measured every subject gave the same response that he 
would have given at that time to any of the other 12 magnitudes of the inde- 
pendent variable. This hypothesis is tested by computing a product-moment 
correlation coefficient or some other correlation statistic. Under the null hy- 
pothesis and random assignment there are 13! (factorial 13) — 6,227,020,800 
equally probable pairings of the 13 magnitudes of the independent variable 
with the 13 response magnitudes. "Thus, to determine with a randomization 
test what proportion of the sampling distribution of pairings gives as large 
a correlation statistic as the obtained value would require the computation 
of over six billion correlation statistics. Even an electronic computer that 
paired the measurements and computed a correlation statistic every second 
would take 197 years of continuous, 24-hours-a-day operation to compute all 
of the correlation statistics in the entire sampling distribution. 

Now consider the application of an approximate randomization test to the 
same situation. An electronic computer is programmed to use a table of ran- 
dom numbers to simulate taking a random sample, with replacement, of 999 
pairings from the entire sampling distribution and then to compute a correla- 
tion statistic for each of the selected pairings. This provides an approximate 
sampling distribution of 999 correlation statistics within minutes, in contrast 
to the 197 years required for the entire sampling distribution. 


D. DETERMINATION OF SIGNIFICANCE FOR 
APPROXIMATE RANDOMIZATION TESTS 


Now consider the procedure for determining the significance of results 
when approximate randomization tests are performed. The approximate 
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sampling distribution is ло? used in exactly the same way as the entire sam- 
pling distribution in determining significance. Instead of determining the pro- 
in the approximate sampling distribution that are as 
large as the obtained statistic, one determines the proportion within the joint 
distribution consisting of the approximate sampling distribution plus the ob- 
tained statistic that are as large as the obtained statistic. For example, when 
the approximate sampling distribution consists of 999 randomly selected 
ation problem, one determines how many of the 
(the 999 in the approximate sampling distribution 
plus the obtained statistic) are as large as the obtained statistic. (Since the 
obtained statistic is as large as itself, there can be no less than one of the 1000 
statistics as large as the obtained statistic.) This number is divided by 1000 to 
get the probability under the null hypothesis of getting such a large correla- 
tion statistic as the one obtained. One can then use this probability to decide 
whether to reject the null hypothesis. For instance, with a significance level 
of .05 the null hypothesis is rejected when the computed probability is no 
greater than .05; and with a significance level of .01, the null hypothesis is 
rejected when the computed probability is no greater than .01. That is, the 
null hypothesis is rejected at the .05 level whenever the obtained statistic 
falls within the upper five percent of the 1000 statistics (including the ob- 
tained statistic itself) and at the .01 level whenever the obtained statistic falls 
within the upper one percent of the 1000 statistics. 

It will now be shown that this procedure is valid: i.e., that when the null 
hypothesis is true the probability of rejecting the null hypothesis is equal to 
the level of significance. For example, when the null hypothesis is true, for 
this procedure to be valid the probability of rejecting at the .01 level must 


be .01. | 
pm assignment in conjunction with the null hypothesis деше опе 
to regard the obtained statistic as a random sample of one statistic taken from 
the entire sampling distribution of statistics. When the obtained € is 
added to the 999 randomly selected statistics there are, under the a y- 
pothesis, 1000 statistics randomly selected from the entire mue istri- 
bution. The obtained statistic can be regarded as the first statistic to be drawn. 
In random sampling there is, of course, no systematic tendency for the first 


drawn value to be either larger or smaller than values drawn later. So, for 
1000 randomly selected statistics the probability that the obtained, statisto 
(the first drawn statistic) will be within the upper half of the statistics is Y 
that it will be within the upper quarter is М, and so on. Thus the probability 


is .05 that, when the null hypothesis is true, the obtained statistic will fall 


portion of the statistics 


statistics, as in the correl 
1000 correlation statistics 
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within the upper 5 percent of the 1000 statistics, and the probability is .01 
that it will fall within the upper one percent of the 1000 statistics. Now, the 
recommended procedure rejects at the .05 level when the obtained statistic 
is in the upper five percent of the statistics and at the .01 level when it is in 
the upper one percent; so the probability under the null hypothesis of re- 
jecting at the .05 level is .05 and at the .01 level is .01. Approximate random- 
ization tests are therefore valid, not approximately valid, procedures for 
testing null hypotheses. Probabilities computed from approximate sampling 
distributions are not simply approximations to probabilities based on entire 
sampling distributions, but are probabilities in their own right. 

The same argument could have been used to show the validity of the 
proposed procedure if sampling without replacement had been carried out, 
but the following discussion of the power of the approximate randomization 
test will be considerably simplified by restricting it to sampling distributions 
obtained by sampling with replacement. 


E. Power or Approximate RANDOMIZATION TESTS 


The next consideration is the relative power of the approximate random- 
ization test based on an approximate sampling distribution of 999 statistics. 
First the following statement will be proved: 

Statement 1: The probability is .99 that no more than 17 statistics from 


the upper one percent of the entire sampling distribution will be in the ap- 
proximate sampling distribution, 


Then it will be shown that Statement 1 implies Statement 2: 


Statement 2: The probability is .99 that, whenever use of the entire sam 
pling distribution would lead to judging an obtained statistic to be significant 
at the .01 level, use of the approximate sampling distribution would lead to 
the assignment of a probability no greater than .018, 

For the proof of Statement 1 the entire sampling distribution is to be 
regarded as consisting of two kinds of statistics: those that are in the upper 
one percent of the distribution and those that are not. When the entire 
sampling distribution is randomly sampled with replacement, for a single 
draw there is a probability, > = .01, of drawing a statistic from the upper 
one percent of the distribution and a probability q = .99 of not doing so: 
that is, of drawing a statistic from the lower 99 percent of the distribution. 
Replacement after each draw ensures that P and q stay constant over all 
draws. The binomial distribution of the number of statistics from the upper 
1 percent of the entire sampling distribution for N — 999, р — .01, and 
q — .99 is almost a normal distribution with p = Np = 9.99 and о = 


© 
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ММ = 3.14. Ninety-nine percent of the area under a normal curve lies 
below the point that is 2.33 standard deviations above the mean. Consequently 
the probability is .99 that the number of statistics from the upper one per- 
ibution contained in the approximate sampling 


cent of the entire sampling distri й 
distribution is no greater than 9.99 4- (2.33) (3.14) — 17.31. This verifies 
assumed to occupy the interval 


Statement 1, since the discrete frequency 17 is 
16.5 to 17.5 in a normal distribution. 
Statement 1 also has been verified by computations by an electronic com- 


puter on the exact binomial distribution (not the normal curve approxi- 


mation). 2. | 
To see that Statement 1 implies Statement 2, it is necessary to appreciate 


that Statement | is true regardless of the magnitude of the obtained statistic, 
inasmuch as the selection of the statistics in the approximate sampling dis- 
tribution is in no way affected by the magnitude of the obtained statistic. 
Therefore Statement 1 is true even when the obtained statistic is large 
enough to be judged significant at the .01 level by means of the entire sam- 


i istributi Phi derive Statement 2 from State- 
1 ‚ This fact enables one to 
pling distribution ;ould be judged significant at the .01 level by 


ment 1. An obtained statistic W bej ті і 
means of the entire sampling distribution only if it had a value that lies 
f the entire sampling distribution of statistics. 


ercent 0 VL эк мй r 
g distribution that might pos- 


within the upper one P ; 1 
he approximate samplin 


The only statistics in t 
sibly diee as the obtained statistic, therefore, would be those from the 


upper one percent of the entire sampling distribution. This еп in conjunc- 
tion with Statement 1 (which has already been wai, Байый This Suis 
clusion: When an obtained statistic would have been judged significant А! 
the .01 level by means of the entire E pesos pope on el 
3 A e no more than va approximate sí 
да dhat hate Wi b M а the шие ve P 909 а to he 
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from the upper one pétcett sampling distribution plus the obtained statistic 
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Analogous computations and arguments in regard to the binomial distri- 
bution of the number of statistics from the upper five percent of the entire 
sampling distribution for N — 999, 5 — .05, and 4 — .95 lead to this state- 
ment: 

Statement 3: The probability is .99 that whenever use of the entire sam- 
pling distribution would lead to judging an obtained statistic to be significant 
at the .05 level, use of the approximate sampling distribution would lead to 
the assignment of a probability no greater than .066. 

Like Statement 2, Statement 3 has also been verified by electronic computer 
computations on the exact binomial distribution. 

One can, of course, follow the same sort of procedure to determine proba- 
bility statements comparable to Statements 2 and 3 for other levels of sig- 
nificance or for approximate sampling distributions of other sizes. An approx- 
imate sampling distribution with more than 999 statistics has more power, 
but it can be seen that this number provides sufficient power for many 
purposes. 


Е. Summary 


Ап approximate randomization test is a randomization test in which the 
significance of an obtained statistic is determined by using an approximate 
sampling distribution consisting of a random sample of the statistics in the 
entire sampling distribution. 

he purpose of Approximate randomization tests is to reduce the amount 
of computation for à randomization test to a practical level, thereby over- 


coming the only serious objection that has been raised against the use of 
randomization tests, 


tistic (such as a Correlation coefficient or a difference between means) is 
determined by reference to a distribution composed of the approximate sam- 
pling distribution plus the obtained statistic. The significance is the proportion 
of statistics within this distribution that are as large as the obtained statistic. 
This proportion is the probability under the null hypothesis of getting such 


When the approximate sampling distribution consists of 999 statistics, the 
power of the approximate randomization test is almost equal to that of the 


P 


EUGENE S. EDGINGTON 149 


randomization test employing the entire sampling distribution. Specifically, 
the probability is .99 that whenever use of the entire sampling distribution 
would lead to judging an obtained statistic to be significant at the .01 level, 
use of the approximate sampling distribution would lead to the assignment 
of a probability no greater than .018. Also, the probability is .99 that when- 
ever use of the entire sampling distribution would lead to judging an ob- 
tained statistic to be significant at the .05 level, use of the approximate sam- 
pling distribution would lead to the assignment of a probability no greater 
than .066. These probability statements are true for approximate sampling 
distributions of 999 statistics selected at random, with replacement, from the 


entire sampling distribution regardless of the size of the entire sampling 


distribution. 
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WHO SEEKS TREATMENT? А STUDY OF PSYCHIATRICALLY 
DISABLED VETERANS* 


Mental Hygiene Clinic, Oakland VA Office’ 


HERBERT C. ARCHIBALD, Davip E. PARKER, M. Kent SMITH 
, 
AND Reap D. TUDDENHAM 


A. INTRODUCTION 


Every public facility for treatment is concerned with the degree to which 
it serves its potential population of eligible patients. This is particularly true 
of the Veterans Administration because the nation has long since com- 
mitted itself to providing optimal care to all former members of the armed 
forces who meet eligibility requirements. The present study compares two 
groups of psychiatrically disabled veterans: those who receive psychiatric 
treatment in an outpatient Veterans Administration Mental Hygiene Clinic; 
and those who do not receive treatment, although equally eligible. It is part 
of a continuing research investigation both of the treatment process and of 
the characteristics of the outpatient psychiatric veteran population. However, 
the present concern is not with treatment, but rather with those factors 
associated with the veteran's decision to enter psychotherapy. 

1. Previous Studies 


and Beebe (3) reviewed medical follow-up studies with 
ich were then either completed or in progress. At 
that time, the bibliography included some 99 reports covering the entire 
field of medicine. Only a fraction of the studies were in the general field of 
psychiatry. One such study, that by Sullivan (6) in 1955, is of interest be- 
Cause it, like the present investigation, was conducted at the Oakland, Cali- 
fornia, Veterans Administration Clinic. Sullivan compared treatment and 
nontreatment groups with respect to the MMPI. His nontreatment group 
Consisted of veterans with a primarily psychiatric diagnosis who came to the 
Outpatient clinic for a periodic evaluation of their compensation status, but 
who had not been treated there. The treatment group consisted of active pa- 
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tients at the Veterans Administration Clinic. Differences between the groups 
were not great, but those who came for treatment were the more disturbed. 
It proved possible to create an MMPI scale which differentiated patients 
from nonpatients with a 72 percent hit rate on cross-validation. Items chosen 
more often by patients related to excessive morality, excessive conscientious- 
ness, guilt, and self-defeating neurotic trends. 

In 1958, Sullivan, Miller, and Smelser (7) did a related study at the 
same clinic on factors in “length of stay" and “progress in psychotherapy." 
Only age, education rating, and occupational rating were reliable predictors 
of length of stay. They reported that on other variables the results were 
extremely inconsistent from sample to sample. Improvement, as rated by staff 
members, was associated with higher occupational levels and with less serious 
psychopathology. 

Diener and Young (4) in 1961 studied psychiatrically diagnosed veterans 
in Colorado, and failed to find significant relationships between coming or not 
coming for treatment on the one hand, and on the other hand, degree of dis- 
ability, number of compensation cuts, or husband vs. son role played by 
veterans in their families. However, extending the sample to the entire popu- 
lation of such cases in Colorado, in 1964, Dworin, Green, and Young (5) 
found that both degree of disability and nearness to the mental hygiene clinic 


had significant correlations (.33 and .37 respectively) with coming to the 
clinic for treatment. 


2. Present Study 
The concern of the Oakland Veterans Administration Clinic to check its 
own effectiveness, especially in the light of inconsistencies in previous inves- 
tigations, suggested the desirability of further research. Specifically, the clinic 
staff wished to extend Sullivan's MMPI study to explore differences between 
treated and nontreated veterans in respect to overt symptoms, Further, the 
clinic wished to evaluate the implications of the Dworin ef al. (5) and Sul- 


livan et al. (7) papers that demographic and geographical factors may be im- 
portant in seeking treatment. 


B. METHOD 


1. Sample 


The "treatment" group consisted of 67 psychiatrically disabled veterans 
in active psychotherapy. The length of treatment was from one to over 100 
interviews. All but 10 cases had had more than nine interviews, the median 
length of stay in this clinic (7). 
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The “nontreatment” group consisted of 68 veterans who came to the clinic 
only for a review of their neuropsychiatric compensation claims. Successive 
neuropsychiatric re-evaluation cases were referred to the mental hygiene clinic 
until the group approximated the treatment group in size. These veterans had 
never sought outpatient psychiatric treatment in the Veterans Administration. 


2. Procedure 


Patients in both treatment and nontreatment groups were asked to complete 
a questionnaire? containing a checklist of somatic symptoms of stress, and 
items concerning subjective feelings of discomfort, personal worries, phobic 
fears, and anxiety. This questionnaire also included sociological items per- 
taining to job stability, marital status, and history, etc. 

The nontreatment cases were interviewed by a staff member, after they had 
completed the questionnaire, to check further upon whether or not they had 
had any psychotherapy, and if so, its nature and amount. On the basis of this 
interview, veterans were rated on 12 scales, each consisting of five carefully 
defined steps. Variables included were speech quality, motor behavior charac- 
teristics, anxiety level, depression, hypomania, anger, hostility, abnormal men- 
tal trends, attitude toward treatment, frankness, emotional distance, and 


dependence. 
C. RESULTS 


Fifty-seven items of the questionnaire could be described as "complaints" 
to be checked or not checked. The questionnaire included 73 items for combat 
veterans (10 percent of the clinic population). Eight questions were deleted 
for those who were service-connected but saw no combat. Twenty-two symp- 
ach group for "in service,” “at discharge," and for the 


toms were checked in e l 
c facts and a complete neuropsychi- 


present (time of interview). Demographi 


atric examination were available in every case, 
Notwithstanding the fact that the checklists were filled on the day of 


compensation examination by the untreated veterans but after an average of 
seven hours of psychotherapy for the treated veterans, no change was noted in 
the Jatter’s basic attitude toward symptoms or the preponderance of psycho- 


logical symptoms. e. 
Included were such symptoms as severe headache, or irritability, fears— 
such as fear of jet plane noises—and “worries’—for example, about letting 


i ire i i i i t al. (1, 2). Many 
2 i tionnaire is described at greater length in Archibald e ‚2 
of Mh pues differentiated combat stress patients both from healthy combat 
veterans and from psychiatrically disturbed noncombat veterans. 
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others down, being injured, or becoming insane. Of these 57 symptoms, 44 
were checked more frequently by the patients than by the nonpatients ; only 13 
more frequently by nonpatients than by patients. This ratio is significant 
beyond the .001 level. Clearly, members of the treatment group cither ex- 
perienced more symptoms than members of the nontreatment group or were 
more willing to acknowledge having them. Within the nontreatment group, 
interview ratings correlated meaningfully with questionnaire items, suggesting 
that those who checked the most symptoms were seen as being, for example, 
more "anxious" and "depressed" by an observer? 

Clinical experience had led the authors to suspect that patients in treatment 
are relatively more likely to acknowledge phobic fears and worries; whereas 
the nontreatment cases are relatively more likely to report symptoms having 
a strong somatic component. To test this hypothesis, the two groups were 
compared with respect to choice frequency (i.e., frequency of item checking) 
summed across the 10 most clearly somatic items versus choice frequency 
summed across the 27 more clearly "psychological items," consisting largely 
of worries and fears. The y? was significant (one tail) at the five percent 
level. While less certain than the finding as to overall number of symptoms 
in the two groups, there is a strong indication that patients view their diffi- 
culties as more psychological in nature than do nonpatients; whereas the non- 
patients emphasize their somatic symptoms. 

The specific symptom items differentiating the two groups at various sig- 
nificance levels are listed in Table 1. The psychological aspects of the concerns 
of patients in treatment are apparent in the table. While the nontreatment 
group checked somatic symptoms more often than did the treatment group, 
only one—"sweaty hands or feet"— reached statistical significance. A few 
other items approached the 10 percent level. More nontreatment than treat- 
ment cases denied sexual difficulties, and more nontreatment were still active 
in a veterans' organization. Asked to compare their general health at present 
to their health status while in service, nonpatients had a tendency to use the 
more extreme categories, "better" and "much worse," while patients more 
often saw themselves as unchanged. Replication of the study would be re- 
quired to establish the meaningfulness of these last differences. 

Sociological variables, such as marital status and job stability, were not 
differentiating; but social status as measured by educational level and occu- 
pational level was a factor with regard to staying in therapy (Table 2). 


3 Since the interviewer had access to the questionnaires to check for omitted items 
or skipped pages, there is a possibility that his ratings may have been contaminated 
by the subjects’ responses to the questionnaire, although the ratings were intended to 


be independent. 
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TABLE 1 
DICHOTOMOUS QUESTIONNAIRE ITEMS WHICH SIGNIFICANTLY DIFFERENTIATE 
TREATMENT vs. NONTREATMENT GROUPS 


Significant at Significant at 
10% level 


Significant at 
196 level 5% level 


A. Items checked тоге often by the treatment group 


Irritability Depression Sighing and yawning 
(Hyperventilation) 
Alcohol to excess Worry about insanity Other things (than 


therapy) have 

helped your 

condition 
Worry about fear of 


Worry about specific 
hurting others 


acts you were 
ashamed of 
Matters overlooked by 
clinic which would 
help in understanding 
your condition 


B. Item 


Additional symptoms 
(added by veteran) 


s checked more often by the nontreatment group 
(none) 


Sweaty hands or feet (none) 
TABLE 2 
COMPARISON OF EDUCATIONAL AND OCCUPATIONAL LEVELS 
TY тж Stayers Nonstayers Nontreated 
Variable Mean SD Mean SD Mean SD 

Educ. rating 2.84 1.19 342 1.26 3.61 147 
Оссир. rating 427 1.87 4.90 1.82 4.91 2.10 

R c scales for Educational and Occupa- 


Note: Ratings on the Warner socioeconomi 
tional level indicate that there is a trend fo: 


level to be associated with not seeking treatment « ; 
ment. The differences between the Stay and Nonstay group on Educational and 


Occupational are significant at the .001 level and the .01 level respectively. While the 
nontreated educational and occupational levels are not significantly different from 


the nonstayers, their scores show a trend in the expected direction. 


The accessibility of the clinic to the 
to be a factor. In checking the trav 
subjects would have to come, it was 
100 miles or more from the clinic. 


r lower educational and occupational 
nt or seeking but not staying in treat- 


veteran's place of residence also seemed 
elling distance that these nontreatment 
discovered that many lived as far as 
They could travel this distance for 


their home town from 
correlated with th 


nonsignificant (.0 е 
outside Oakland. 'Thus, distance o| 


If they decided to become 
how long they remained in treatment. 
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periodic compensation examinations, but could hardly have been expected to 
have become patients at this clinic. Yet, many come from large urban centers 
where state mental health services are available, and private psychiatric 
services can be obtained locally in many communities and paid for by the 
Veterans Administration. Many of these patients, then, could have obtained 
some form of treatment had they desired it. This consideration led us to 
suspect that many patients in the treatment group would not have obtained 
treatment if they had lived outside a reasonable commuting radius from the 
clinic. To check on the influence of distance per se on use of the clinic, the 
home town of each subject, treatment or nontreatment, was obtained and 
scored according to the number of 10-mile intervals between it and the clinic. 
The distances were measured from an ordinary road map of the Bay Area; 
they were taken "as the crow flies" and, hence, only approximated travelling 
time. Nonetheless, differences between the two groups were more marked 
than had been anticipated. Every member of the treatment group livéd within 
a 30-mile radius of the clinic, while only 22 of 68 in the nontreatment cate- 
gory lived within this radius. The two distance distributions were compared 
by means of the Mann-Whitney U-test. "The result was significant far beyond 
the one percent point (— S4. 


D. CowcLusioxs 


In this study of utilization of an outpatient Veterans Administration mental 
hygiene clinic facility, the most conspicuous difference between veterans who 
elected and those who did not elect psychiatric treatment is distance of resi- 
dence from the clinic. However, the fact that nontreatment cases living 
at a distance from the clinic did not elect treatment at easily accessible state 
or county facilities or from psychiatrists in their own communities paid by 
the Veterans Administration suggests that the causal factors in accepting or 
avoiding treatment are more complicated than they appear. Those who sought 
and stayed in treatment tended to be of higher educational and occupational 
level. Study of the reasons for nonutilization or underutilization of the avail- 
able non-Veterans Administration facilities by veterans appears to be indicated. 

Patients and nonpatients appear to differ also in the way they formulate 
their difficulties. Patients reported more symptoms and their complaints 
were more psychological in nature. Nonpatients emphasized somatic symp- 
toms, and indeed often denied psychological concerns. Comments, such as the 
following, were typical in this group: 

“I don't understand why I was sent up here. Does that mean someone 
thought I was emotionally unstable?" 
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« , 
"idu t understand why they call me a nervous case. I had a back injury." 
I've never been emotional in my life. I don't worry about anything. I 


have a wonderful wife." 
However, it is uncertain whether the patients elected psychiatric treatment 


because they were more sensitive to the psychological component of their 
difficulties or whether, prompted by unknown factors, they came in for treat- 
ment and, in the course of it, learned to see their problems in a psychological 


light. 
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A COMPARISON OF “REAL” VS. “IDEAL” SELF WITH A 
SELF-ACTUALIZATION INVENTORY* » 2 


Department of Psychology, University of Bridgeport 


Joun R. BRAUN AND PATRICIA ÅSTA 


A. INTRODUCTION 


The Personal Orientation Inventory (POI) contains 150 paired-opposites 
12 scales attempting to measure self-actualization (10). In- 
as nonself-actualized have obtained significantly lower 
(less favorable) scores on 11 of these scales than those nominated as self-ac- 
tualized (9, 10). There have been similar findings with “highs” vs. “lows” 
on the Neuroticism scale of the Eysenck Personality Inventory (6), alcohol- 
ics vs. normals (12), and beginning therapy patients vs. patients in therapy 
for two years (11). Ilardi and May (5) found that POI reliability compares 
favorably with that of the MMPI, and Grossack (4) has studied Edwards 
Personal Preference Schedule and 16 PF correlates of self-actualization 
scores, Damm (3) reported a significant positive correlation between per- 


formance on the POI and the Remote Associates Test (a measure of cre- 
ativity), and Braun and Asta (1) have found significant positive correlations 


between the Original Thinking Scale of the Gordon Personal Inventory and 


five of the POI scales. 

Braun and LaFaro (2) had four groups of undergraduates complete the 
POI under standard instructions, and later, under instructions to make either 
a good impression or appear well-adjusted. Results showed a less favorable 
score when faking in 45 of the 48 comparisons between standard and faked- 
administration means, with 23 of the 48 2 tests significant at the .05 level or 
better. АП 23 of the significant findings involved lower faked versus standard 
administration scores. It was concluded that “Apparently the values deemed 
to be those of self-actualizing persons ате not those judged likely to create a 
o be indicators of good adjustment by the typical college 


items scored for 
dividuals nominated 


good impression or t 


student (or convicted felon!).” 
es 

» Received in the Editoria 
and published immediately at 3 
The Journal Press. 
1 Presented at the annual conven 


i .C., April, 1968. . R 
Wadi ora are Fadebted to Mrs. Frances Brown for her kind cooperation in ob- 


taining the data on 


Massachusetts, on April 1, 1969, 


] Office, Provincetown, 
Massachusetts. Copyright by 


5 New Street, Worcester, 


tion of the Eastern Psychological Association, 


159 


160 JOURNAL OF PSYCHOLOGY 


Rapaport (7) studied the effects of real-self vs. ideal-self instructions on 
the MMPI performance of 48 military psychiatric patients. Ideal-self means 
were significantly more favorable than real-self means for the majority of the 
scales. The present study compared POI performance under real-self and 
ideal-self instructions. Persons rather consistently score lower on self-actuali- 
zation when trying to make a good impression than they do when trying to 
be honest. How would they like to be in terms of self-actualization? 


B. METHOD 


Three groups of undergraduates were used as subjects. Group I consisted 
of students enrolled in evening sections of introductory psychology (real-self 
М = 60; ideal-self N = 67). Group II consisted of student nurses enrolled 
in introductory psychology (real-self N = 51; ideal-self N — 52). Group 
III consisted of volunteers recruited from introductory psychology (N for 
both real and ideal self = 63). There were approximately equal numbers of 
men and women in Groups I and III, while Group II had all women. 
Groups I and II were tested during regular class meetings, while Group III 
was tested outside of class. 

All groups completed the POI on two occasions. The Ns for the two 
administrations differ slightly for some groups due to absences from class. 
Groups I and II had ideal-self administration first, and approximately one 
month later, real-self administration. Group III had real-self administration 
first, and one week later, ideal-self. 

Groups I and II did not put any identifying data on their answer sheets, 
while Group III used only self-selected pseudonyms to enable matching of 
first and second administration protocols for statistical purposes, Significance 
of differences between means for the two administrations was evaluated by ¢ 
tests for Groups I and II, and by Sandler's A-statistic (8) for Group III. 

Real-self instructions emphasized being honest and forthright "answer as 
you really are." Ideal-self instructions indicated “. -. We want you to answer 
so as to tell us about your ideal self. That is, what sort of person would you 
ideally like to be? Answer not in terms of your real self, the kind of person 


you are now; but rather, answer in line with the kind of person you would 
like to be: your ideal self.” 


C. RESULTS AND Discussion 


Table | presents means, standard deviations, and results of significance 
tests for the two administrations. The direction of difference between real- 


=a 
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self and ideal-self means was consistent across all three groups for 10 of the 
12 POI scales, with 21 of the 36 ¢ or A-tests significant at the .05 level or 
better. Time Competence, Self-Actualizing Value, Spontancity, Self-Regard, 
Nature of Man, Constructive, and Synergy were increased in all three 
groups; while Existentiality, Feeling Reactivity, Self-Acceptance, and Accep- 
tance of Aggression were decreased. For each of the above scales, the differ- 
ence between real- and ideal-self means was significant in at least one, and 
more often, two or three, of the groups. The Inner Directed and Capacity 
for Intimate Contact scales were not affected consistently by the different in- 
structions across all three groups, and for these scales, none of the compari- 
sons between real- and ideal-self means reached significance. 

All 12 POI scales are assumed to measure an aspect of self-actualization. 
Under ideal-self instructions, our subjects moved in the direction of greater 
actualization on six scales, while moving toward less actualization on four 
other scales. From scale descriptions in the POI manual (10) we would say 
that under ideal- vs. real-self instructions, they prefer to live in the present 
rather than past or future; hold, rather than reject, the values of self-actual- 
izing people; freely express feelings behaviorally rather than being fearful of 
doing so; have high, rather than low, self-worth; see man as essentially good 
rather than evil; and see the opposites of life as meaningfully related rather 
than antagonistic. On the other hand, we would also say that they prefer to 


be rigid, rather than flexible, in application of values; insensitive, rather than 
sensitive, to own needs and feelings; unaccepting, rather than accepting, of 
the weaknesses of self; and denying, rather than accepting, of feelings of 
anger or aggression. 

With six scales increase 
there is no discernible tendency 


d and four decreased under ideal-self instructions, 
for subjects to wish to be either higher or 
lower in self-actualization than they think they already are. This conclusion 
is supported by the lack of any significant change on the Inner Directed scale, 
which contains 127 of the 150 items, and which is regarded by Knapp (6) as 
the single most representative overall measure of self-actualization, 

It would be of interest to determine if the same trends found in the present 


study would be present with subjects of different age, education, socioeco- 
nomic status, etc. In addition, the contribution of social desirability to the 
various POI scales should be investigated. Are the items on the scales which 
were increased under ideal-self instructions significantly higher in mean social 
desirability rating than those which appear on the scales that were decreased 


under this instruction? 
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TABLE 1 
MEANS AND STANDARD DEVIATIONS FOR "REAL" versus "IDEAL" SELF 
Group 
Parameter Instruction 1 п Ill 
Time Competence 
Mean Real Self 15.47 15.13 15.40 
SD Real Self 3.31 3.91 3.34 
Mean Ideal Self 17.15** 16.79* 16.70** 
SD Ideal Self 2.96 2.84 2.98 
Inner. Directed 
Mean Real Self 80.93 78.75 78.08 
SD Real Self 11.52 11.69 10.35 
Mean Ideal Self 81.85 75.48 79.67 
SD Ideal Self 9.33 12.68 9.27 
Self-Actualizing Value 
Mean Real Self 19.18 1849 17.94 
SD Real Self 2.78 3.43 3.18 
Mean Ideal Self 20.12 19.65 20.11** 
SD Ideal Self 2.52 2.72 3.45 
Existentiality 
Mean Real Self 19.53 18.27 19.08 
SD Real Self 4.06 4.55 3.50 
Mean Ideal Self 17.84* 15.79** 157954 
SD Ideal Self 3.88 3.64 445 
Feeling Reactivity 
Mean Real Self 16.00 14.63 15.43 
5р Real Self 2.68 3.00 2.73 
Mean Ideal Self 14.82* 14.21 14.06** 
SD Ideal Self 2.68 2.80 2.95 
Spontaneity 
Mean Real Self 11.45 10.60 11.00 
SD Real Self 2.35 2.82 248 
Mean Ideal Self 12.04 ne 
ш 11.39 12.33 
SD Ideal Self 2.52 2.11 2.21 
Self-Regard 
Mean Real Self 10.62 1 
8р Real Self 2.80 E 1945 
Меап Ideal Self 12.93** 12.94** 13.87** 
SD Ideal Self 1.89 152 1.71 
Self-A cceptance 
Mean Real Self 15.05 15.37 1444 
SD Real Self 4.00 3.28 3.39 
Mean Ideal Self 14.67 13.92* 13.60 


SD Ideal Self 3.36 3.01 2.64 
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TABLE 1 (continued) 


Group 
Parameter Instruction I II III 
Nature of Man: Constructive 
Mean Real Self 11.27 11.45 
SD Real Self 2.13 219 '196 
Mean Ideal Self 12.15* 11.85 ў 2** 
SD Ideal Self 2:22 1.94 1076 
Synergy 
Mean Real Self 6.28 6.10 6.25 
SD Real Self 147 1.68 1:39 
Mean Ideal Self 6.84* 6.83* 6.75** 
SD Ideal Self 1.34 1.33 1.34 
Acceptance of Aggression 
Mean Real Self 16.33 14.78 15.41 
SD Real Self 2.93 3.54 346 
Mean Ideal Self 15.06* 14.19 14.41* 
SD Ideal Self 3.04 2.56 2.62 
Capacity for Intimate Contact 

Mean Real Self 17.22 16.61 16.82 
SD Real Self 3.36 3.25 3.19 
Mean Ideal Self 17.25 15.42 1641 
SD Ideal Self 3.63 3.32 3.81 


» Significantly different from "Real Self" mean at .05 or .02 level. 
** Significantly different from “Real Self" mean at .01 or .001 level. 


D. SUMMARY 
aduates completed a measure of self-actualization, 
under both standard and ideal-self in- 
hich compared scores under standard 
Its were inconsistent across the 12 
appeared more self-actualized 


"Three groups of undergr 
the Personal Orientation Inventory, 
structions. Contrary to earlier studies, w 
vs. "good-impression" instructions, resu 
scales. "Thus, on six scales subjects consistently 
under ideal-self instructions than standard instructions, while on four others 
the reverse was true. Inner-Directedness (regarded by Knapp as the single 
most representative overall measure of self-actualization) and Capacity for 
Intimate Contact were not affected significantly by the different instructions 


in any of the groups. 
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THE CLINICAL UTILITY OF BLOCK'S EGO-RESILIENCY 
AND EGO-CONTROL SCALES* 


Department о} Psychology, Saint Louis University 


Dennis Kxzrr, Rosenr T. BopENSTEINER, THomas W. VoDDE, 
AND Matcoitm D. GYNTHER 


A. INTRODUCTION 
extensive research on the two major factors of 


the MMPI, which has taken into account both the need to control for the 
confounding influence of stylistic factors and the need to obtain behavioral 
correlates as validating evidence. Based on data from nonpsychiatric samples, 
he identified the first factor as “ego-resiliency” and the second as "ego-con- 
trol." He also developed two scales to measure these dimensions in which the 
effects of acquiescence and social desirability (SD) are controlled. 

Further information regarding the clinical utility of Block's ego-resiliency 
(ER-S) and ego-control (EC-5) scale scores seems desirable, especially since 
Block has indicated that his scales should operate with greater discriminating 
power “їп samples containing individuals with a wider range of psychopathol- 
ogy—samples more appropriate to the initial intent and aspiration of the 
MMPI” (1, p. 109). The purpose of the present study was to investigate 
the relationships among ER-S and EC-5 scores of psychiatric inpatients and 
diagnostic classifications, 1Q, education, marital status, and 


other clinically relevant vari 


Block (1) has conducted 


age, sex, race, 
ables. 


B. METHOD 


1. Pilot Study 
atients were obtained by selecting every 
Iphabetized list of 1,281 Malcolm Bliss Mental 
i AMPI’s between 1957 and 
i Љо had been given А 
voe = ! sage un due to the MMPI short forms, this procedure 
resulted in a sample of 130 patients: 21 Negro males, 53 white males, 11 
N females, and 45 white females. The range of ER-S scores was nine to 
Ml wd eo of 18.56 and standard deviation of 5.59. The range of the 
31 with a mea 56 а 


Norms for hospitalized psychiatric p 


eighth patient from an à 


Massachusetts, on April 2, 1969, 


rincetown , 
T5 y Massachusetts. Copyright by 


Worcester, 


z + 1 c 
* Received in the Editorial e Street, 


and published immediately at 
The Journal Press. 
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EC-5 scores was seven to 29 with a mean of 16.25 and standard deviation of 
9.32. The respective distributions were both slightly positively skewed, but 
very close to normal. 

A random sample of 40 Ss was selected from the 130 Ss to examine possible 
relationships between ER-S and EC-5 scale scores and number of previous 
psychiatric hospitalizations, and social maladjustment status (i.e., prisoner or 
court referral), in addition to the variables referred to earlier. No significant 
relationships were found, but several trends were noted: (a) behavior disor- 
ders tended to obtain relatively high ER-S scores and relatively low EC-5 
scores; (0) neurotics and organics tended to obtain the opposite pattern of 
scores; and (c) no relationship between either score and psychotic diagnosis 
was observed. 


2. Main Study 


a. Subjects. 8s were drawn from the entire pool of 1,281 patients by in- 
cluding every tenth name listed until 100 were obtained. Ss used in the pilot 
study proper (N = 40) were excluded. The age range for this sample was 
17 to 61 with a mean of 32.77 years. The /Qs ranged from 63 to 128 with a 
mean of 96.27, and the range of education was five to 20 years with a mean 
of 10.08. 

b. Procedure. Each $'s MMPI was scored for the ER-S and EC-5 scales. 
The ER-S scale consists of 40 items, 20 keyed true, 20 keyed false, with an 
average SD rating of 5.16 on a nine-point scale. The EC-5 scale consists of 
32 items, 16 keyed true, 16 keyed false, with an average SD rating of 5.33. 
For further details, see Block (1). Ss were grouped into high vs. low scorers 
on each scale with the use of the norms developed in the earlier procedure. 
For ER-S, highs were defined as 18 and above, lows as 17 and below; for 
EC-5, highs were defined as 17 and above, lows as 16 and below. High and 
low ER-S and EC-5 scorers were then compared on the demographic, diag- 
nostic, and clinical variables. 


C. RrsuLTS AND Discussion 


Aside from diagnosis, which will be discussed in the following paragraph, 
the only patient variables that were significantly related to the categories de- 
rived from scores on Block's scales were age and sex differences for the ego- 
control dimension. Older Ss obtained significantly higher scores than younger 
Ss ( = 2.01, p < .05), which is consistent with Block's interpretation of 
low scores on the ego-control dimension as indicating impulsivity, a charae- 
teristic likely to decrease with age. The finding that females were significantly 


= 
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more controlled than males (x? = 15.77, Ф < .001) supports Block's similar 
finding which he has interpreted as reflecting culturally determined sex-role 
patterns concerning management of impulses. Block suggests that separate 
scales be developed for males and females. 

'The diagnostic classifications of high vs. low scorers differed significantly 
at less than the .05 level on both scales. With regard to the ego-resiliency 
dimension, the significant differences resulted from behavior disorders having 
es than neurotics and organics. For the ego-control 
and, behavior disorders had significantly lower 
tics and organics. Scale scores were un- 
finding that is consistent with the data 


significantly higher scor 
dimension, on the other h 
scores than those found among neuro 
related to the psychotic diagnosis, a 


obtained in the pilot study. 


findings suggested that it might be instructive to run a secondary 


a quadrant approach. To sharpen the differences, the 
at or near the midpoint of either scale were excluded. 
classified as low, with ER-S scores of 20 and 
£ 15 or less were classified as low, with 


EC-5 scores of 18 and above classified as high. A comparison between these 
four groups on admission status approached significance Oe ES 753; 05 < 
p < 40), primarily due to the fact that more first admission patients fell in 
the high ER-S—low EC-5 cell than would be expected by chance. 

Because of an insufficient sample size, all organic and neurotic Ss were 
eliminated from the analysis of diagnosis by quadrants. Results of this anal- 
gnificance (х? = 746;.05 < $ < 10) reflecting the fact 
behavior disorders and fewer psychotics than expected 
C-5 scores, while there were more psychotics and 
fewer behavior disorders with high ER-S, high EC-5 scores. Although there 
were only six neurotic patients in the total study, four of these patients were 
found in the low ER-S, high EC-5 category. , mu 

'These results are obviously not Very impressive. One could not predict di- 
agnosis with any useful degree of accuracy by means of the ER-S score, the 
EC-5 score, or “a combination of the two. Yet some significant relationships 
have been demonstrated; which is an improvement over the completely nega- 


tive findings of Gynther and Brilliant (2) with the use of Welsh's (4) anxi- 
E One might conclude that Block's unconfounded scales 


diagnostic utility than confounded measures of the same 


These 
analysis with the use of 
27 Ss whose scores were 
ER-S scores of 17 or less were 
above classified as high. EC-5 scores о 


ysis approached si 
that there were more 


with high ER-S, low E 


ety-repression scales. 

have more potential 

MMPI factors. 
One suggestion for 


ER-O scale (108 items 


g these lines is to use Block's 


further research alon 
first factor, balanced for true- 


also measuring the 
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false, but selected without regard for SD scale values) in place of the ER-S 
scale. Kneff (3) found that psychiatric patients obtained significantly lower 
scores than Block's normal Ss on the former scale, whereas the scores ob- 
tained in this study with the latter scale do not differ from scores Block ob- 
tained from normals. If ego-resiliency means "integrated and yet open to ex- 
perience, able to cope resourcefully" (1, p. 110), one should use the form of 
the scale which most effectively separates normals and patients on this dimen- 
sion. 


D. SUMMARY 


The purpose of this study was to investigate the relationships among 
Block's ego-resiliency and ego-control scale scores and diagnostic classification; 
age, sex, race, JQ, education, marital status, prisoner-nonprisoner status, and 
admission status. Norms for psychiatric inpatients were developed in a pilot 
study. The main sample of 100 patients was divided into high and low scorers 
on each scale. The diagnostic classifications of high and low scorers differed 
significantly on both scales, with neurotics and organics scoring higher than 
behavior disorders on the ego-resiliency dimension and the opposite pattern 
resulting for the ego-control dimension. Scale scores were, however, unrelated 
to the psychotic diagnosis. The only significant findings related to the other 
variables were that older patients scored higher than younger patients, and 
that females scored higher than males on the ego-control scale. This approach 
appears to be promising, but changes in procedure are necessary to increase 
the predictive power of these scales, 
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FACTORIAL STUDY AND ANALYSIS OF FACTOR VARIANCE 
OF GENERAL ADAPTABILITY SELF-RATINGS FROM A 
COMBINED SAMPLE OF MALE AND 
FEMALE SUBJECTS** 


Stevens Institute of Technology 


Rio SCIORTINO 


A. INTRODUCTION 


tino (7) factorized the scores of a combined sam- 


In a recent study Scior 
a Motivational Adjective List 


ple of male and female undergraduate Ss on 
(MAL) and two factors were extracted: striving and assertion. The MAL 


was constructed by Sciortino (6), together with two other separate adjective 
lists, the General Adaptability Adjective List (GAAL) and the Intellective 
Adjective List (IAL), to cover three aspects of the intellectually superior 
and creative individual: namely, motivation, general adaptability, and intel- 
lection, When an analysis of factor variance (8) was performed on the two 
factors of the MAL the results showed that the female Ss rated themselves 
significantly higher than the male Ss on both factors: striving and assertion. 

The purpose of this study was (a) first, to find the factorial structure of 
the General Adaptability Adjective List (GAAL) for a combined sample of 
male and female Ss; and (4) second, to determine, by analysis of factor vari- 
ance, the effect of sex, male vs. female Ss, on the factor score means of the 
factors of the GAAL obtained in section (а). For the factorial structure of 
the GAAL several factorial methods were used and the one that, most ade- 
quately, met the criteria of simple structure was chosen for the interpretation 
of factors. No correlations between the scores on the GAAL and scores on 
other instruments were undertaken in this study. In the analysis of factor 
Variance of the GAAL factors; P ratios were computed for the factor score 


means obtained from several factorial methods. 
B. Factor ANALYSES 
1. Method 


a. Sample. Male (N = 100) and female (N = 102) undergraduate stu- 
Editorial Office, Provincetown, Massachusetts, on April 7, 1969, 
and published immediately at 35 New Street, Worcester, Massachusetts. Copyright by 


Th Я ; | 
eme Pre ily were collected in the spring quarter of 1962 at the Ohio State 
University. 
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dents required to serve as Ss to fulfill the experimental requirements for a 
course in general psychology were used. 

b. Description of the test: The General Adaptability Adjective List 
(GAAL). 'The GAAL, constructed by Sciortino (6), consists of the 39 
items listed in Table 1. Ss rated themselves on each of the 39 items using the 


TABLE 1 
Matrix ОЕ MEANS, STANDARD DEVIATIONS, PRoMAX-EQUAMAX FACTORS FOR SCORES ON 
THE GENERAL ADAPTABILITY ADJECTIVE Lisr (GAAL) BY 
Mace (№ = 100) лмо FEMALE (№ = 102) Ss 


Promax-Equamax factors 


Variables M SD A B с р Е 

1. Spontaneous 3.26 25 18 —25 07 19 
2. Resilient 347 39 —09 10 23 —13 
3. Expansive 3.20 38 14 00 —18 07 
4. Cordial 3.54 35 —26 29 24 —01 
5. Amorous 3.71 49 05 —04 —09 03 
6. Warm 3.67 52 01 06 07 m 
7. Spirited 341 08 42 13 20. 08 
8. Lively 3.53 03 46 06 35 —10 
9. Enthusiastic 3.66 08 31 26 05 02 
10. Cheerful 3.58 21 13 28 25 | —12 
11. Vigorous 345 04 51 —08 05 @ 
12. Quick — 3.32 п = — 13 55 09 
13. Energetic 344 —06 63 —08 09 03 
14. Brisk 2.95 —07 19 — —10 36 06 
15. Active 3.72 13 51 —05 —01 —09 
16. Wholesome 3.66 02 22 43 04 06 
17. Sprightly 3.22 —09 31 13 39 00 
18. Healthy 4.03 —18 25 31 20 04 
19. Dynamic 3.15 —03 52 07 08 06 
20. Agile 3.37 02 22 06 37 —05 
21. Fluent 2.89 00  —07  —12 13 52 
22. Facile | 3.09 02 06 —02 35 13 
23. Evocative 3.16 18 —05 05 02 41 
24. Depictive 2.56 —16 01 00 03 50 
25. Articulate 2.92 —16 04 11 —03 59 
26. Persuasive 3.43 22 06 —02 07 20 
27. Felicitous 3.24 36 07 00 21 05 
28. Eloquent. 2.78 01 12 00 —02 45 
29. Communicative 3.39 37 0+ —07 12 14 
30. Tactful 3.13 —02 —32 16 18 32 
31. Responsive 3.46 36 08 16 —21 13 
32. Leading | 2.95 —07 10 —04 27 26 
33. Cooperative 3.67 05 — —01 53 02 = 
34. Attentive 3.46 02 —13 56 —06 08 
35. Tolerant 3.62 22 —05 44 —13 201 
36. Sociable 3.65 53 00 00 ip — 09 
37. Responsible 3.79 3 —09 06 46 —03 00 
38. Empathetic 3.26 97 16 —0% 21 —24 Ы 
39. Congenial 3.72 84 53 — —05 18 10 —0 
os ees 18 20 20 18 323 


7 Note: Initials h.c. stand for hyperplane count. Decimal points omitted. 
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following five-point scale: (а) least, (b) less than moderately, (c) moder- 
ately, (d) more than moderately, (e) most. The self-ratings constituted the 
scores of Ss on each of the 39 items. All the adjectives were defined and col- 
lected in a separate booklet entitled Adjective Definitions. 

Procedure. The GAAL and the Adjective Definitions booklet were 
given to Ss at the same time. Ss were asked to consult the Adjective Defini- 
tions booklet when they were unsure about the meaning of certain adjectives. 
The same examiner administered the GAAL to both male and female groups 
on two different occasions. The findings reported here are based on the com- 


bined groups (V = 202). 


с. 


2. Results 


3 The computations for this study were carried out on 


a. Computations. 
puter Center and with the P- 


the IBM 7094 operated by the Princeton Com 
Stat computer program. 
The iterative factor anal 
used to perform a factor analy 
tine cycles on the number of signi 


of diagonal estimates. The obtaine 
sponds to the significant roots. "Table А shows the matrix of item correlations. 


Three orthogonal rotations of the initial matrix of significant factors were 
performed with the use of the quartimax (4), the varimax (2), and the 
equamax (5), procedures. "Table B shows the matrix of the initial significant 
factors, and of the quartimax, varimax, and equamax factors. Three promax 
rotations (1), with K = 4, were performed on each of the three orthogonal 
rotations obtaining the promax-quartimax, the promax-varimax, and the pro- 
Table C shows the matrix of the promax-quartimax 
ors, and Table D shows their respective correlations. 
andard deviations of the GAAL items, and 
combined sample. Table 2 shows the cor- 
Table B and C, of the NAPS doc- 


ysis routine of the P-Stat computer program was 
is on the matrix of item correlations. The rou- 
ficant roots and then iterates on the stability 
d initial matrix of significant factors corre- 


max-equamax solutions. 
and promax-varimax fact 
Table 1 shows the means and st 


amax factors for the 


the promax-equ 
factors. 


relations of the promax-equamax 


i i sith the National Auxiliary Publi- 
ET vi terials have been deposited wit y 
vag TE FONE TI A, matris of item correlations (39 X 39) of the GAAL; Ta- 
ble B, matrix of initial significant factors, and of quartimax, varimax, and equamax 
factors; Table C, matrix of promax-quartimax and promax-varimax factors; aad 
Table D, matrix of correlations of promax-quartimax and promax-varimax factors. 
Order NAPS Document No. 00419 from National Auxiliary Publication Service, c/o 
CCM Information Sciences, Inc., 22 West 34th St, New York, New York, 10001; re- 
mitting in advance $3.00 for photocopies or $1.00 for microfiche. 

3 The data matrices of the male (№, = 100) and female (Ху = 102) Ss on the 39 
items (M) of the GAAL were combined into an N, + Nz by M table to БЕДЕ 
M x M correlation matrix, the mean scores, and the standard deviations. 
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ument, and Table 1, of the present study, show that the total .10 hyperplane 
count of the promax solutions is higher than that of the orthogonal solutions. 
However, since the promax-equamax solution has the highest total .10 hyper- 
plane count, it was used to interpret the factors. 


TABLE 2 
CORRELATION MATRIX OF PROMAX-EQUAMAX FACTORS FOR SCORES 
ON THE GENERAL ADAPTABILITY ADJECTIVE List (GAAL) 
(Combined Samples (N — 202)) 


Promax-Equamax factors 


Factors A B c D 
Sociability (A) 
Energy (B) 42* 
Attentiveness (C) 46* 36* 
Quickness (D) 30* 40* 33* 
Articulateness (E) 48* 38* 26* 26* 


Note: Decimal points omitted. 
* p—.01. 


b. Interpretation of factors. Interpretation of factors rests on loadings of 
-25 or greater (p = .01 if N = 202). A factor whenever possible was as- 
signed the name of the variable that has the highest loading with it. 

Factor A (Sociability) has loadings arranged as follows: sociable, .53; 
congenial, .53; warm, .52; amorous, .49; resilient, .39; expansive, .38; com- 
municative, .37 ; responsive, .36; felicitous, .36; cordial, .35; and spontaneous, 
«25. This factor refers to Ss’ sociability or disposition to enjoy the company of 
others. 

Factor B (Energy) has loadings arranged as follows: energetic, .63; dy- 
namic, .52; vigorous, .51; active, .51; lively, .46; spirited, .42; tactful, —.32; 
enthusiastic, .31; sprightly, .31; cordial, —.26; and healthy, .25. This factor 
refers to Ss’ energy level. However, the negative loadings of items tactful and 
cordial suggest that Ss' manifestation of energy detracts from, or has a neg- 
ative relationship with, their being cordial and tactful toward others. 

Factor C (Attentiveness) has loadings arranged as follows: attentive, .56; 
cooperative, .53; responsible, .46; tolerant, .44; wholesome, .43; healthy, .31; 
cordial, .29; cheerful, .28; enthusiastic, .26; and spontaneous, —.25. This 
factor refers to Ss' attentiveness or consideration for others. However, the 
negative loading of item spontaneous indicates that Ss' attentiveness is rather 
controlled and lacks, therefore, a certain degree of spontaneity. . ; 

Factor D (Quickness) has loadings arranged as follows: quick, ps 
sprightly, 39; agile, .37 ; brisk, .36; lively, 3355 facile, .35 ; leading, 27; an 
cheerful, .25. This factor refers to Ss’ quickness, or agility in movement. 


~ 


Б. 
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Factor E (Articulateness) has loadings arranged as follows: articulate, 
.59; fluent, .59; depictive, .50; eloquent, 45; evocative, .41; empathetic, .34; 
tactful, .32; and leading, .26. This factor refers to Ss’ articulateness, or skill 


to express themselves clearly and easily. 


C. AwaLvsis oF Factor VARIANCE 
1. Method 


Computations:* The orthogonal and oblique factors of the GAAL, re- 
ported above, were used in this section to obtain their respective factor scores. 
For this purpose the factor score subroutine of the P-Stat computer program, 
tiple regression method, was used. 'The obtained factor scores 
were then subjected to a one-way classification analysis of variance by using 
another subroutine of the P-Stat computer program. 'Table 3 shows the fac- 
ations, and Ё ratios of the male and female $$ 
on the five factors of the GAAL (sociability, energy, attentiveness, quickness, 


and articulateness), and for six factorial procedures (quartimax, varimax, 
ax-varimax, and promax-equamax). 


based on the mul 
tor score means, standard devi 


equamax, promax-quartimax, prom 
2. Results 
. In all six factorial procedures the means of the 


female Ss are higher than those of the male Ss. However, the F ratios reached 
statistical significance only in the equamax and promax-equamax solutions, 
beyond the .01 level, and in the promax-varimax solution, at the .05 level. 
The significant F ratios indicate that the female Ss rated themselves as being 
more sociable—more disposed to enjoy the company of others—than the male 


Ss. 
hs. Fan B (Energy): In all six factorial procedures, the means of the 
female Ss are higher than those of the male Ss. However, the F ratios reached 
varimax, promax-quartimax, 


statistical significance only in the quartimax, 1 
and promax-varimax solutions, at the .01 level, and in the promax-equamax 


solution, at the .05 level. The significant F ratios indicate that the female Ss 
rated themselves as having a higher energy level than the male Ss. 


c. Factor C (Attentiveness). In the varimax, equamax, promax-quarti- 
and promax-equamax solutions the means of the fe- 


n those of the male Ss; however, in the quartimax 
ale Ss is higher than that of the female Ss. The F 


a. Factor A (Sociability) 


max, promax-varimax, 
male Ss are higher tha 


solution the mean of the m 
= matrix of observed scores 
and F = rotated factor loading 


— factor score matrix, 


$i —1 F; where 5 о i 
ie e Я intercorrelation matrix, 


in standard score form, R = 
matrix, 


* = b. м ~ x‘ 
"(10 = ae 
0-4. 
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ratios reached statistical significance only in the varimax, equamax, promax- 
varimax, and promax-equamax solutions, and beyond the .01 level. The sig- 
nificant F ratios indicate that the female Ss rated themselves as being more 
attentive, or more considerate toward others, than the male Ss. 

d. Factor D (Quickness). In the quartimax, promax-quartimax, and pro- 
max-varimax solutions the means of the female Ss are higher than those of 
the male Ss. In the varimax and equamax solutions the means of the male $$ 


are higher than those of the female Ss, and in the promax-equamax solution 
male Ss is equal to that of the female Ss. The F ratios 
ance only in the quartimax solution, at the .05 


level, and in the promax-quartimax solution, at the .01 level. The significant 
F ratios indicate that the female Ss rated themselves as being quicker, or 


than the male Ss. 
). The mean of the male Ss is higher than 
promax-quartimax, and 


the mean of the 
reached statistical signific 


more agile in movement, 
e. Factor Е (Articulateness 


that of the female Ss in the quartimax, varimax, 
but is lower than the mean of the female Ss in 


amax solutions. Since no solution yielded a sta- 
as found for factor articulate- 


promax-varimax solutions, 
the equamax and promax-equ 
tistically significant F ratio, no sex difference w 


ness. 
D. Discussion AND ConcLUSION 


The correlations of the promax-equamax factors shown in Table 2 range 
from .26 to .48. They are all statistically significant, at the :01 level, and 
indicate that the five factors of the GAAL, sociability, energy; attentiveness, 
quickness, and articulateness 5 n degree of variance, Factor so- 


ciability refers to Ss’ disposition 
ergy refers to Ss’ energy level. Factor 


for others, Factor quickness refers to 85 agilit 
ticulateness refers to Ss’ skill to express themselves clearly and easily. 
the GAAL, shown in Table 3, indicate 


The F ratios of the five factors of the 
that the female Ss rated themselves significantly higher D the male 85 on 
factors sociability, energy, attentiveness, and quickness. No significant sex dif- 
factor articulateness. 
an analysis of factor variance, Pinneau e£ al. (3) have rec- 
ated factor scores; however, in this study both 
res were used. Since the orthogonal so- 
and their corresponding oblique solutions, as another 
on the same four of the five factors of the 
could have been used to find out the sex 


hare a certai 
to enjoy the company of others. Factor en- 


attentiveness refers to Ss’ consideration 
y in movement, and factor ar- 


ference was found for 


In performing 


ommended the use of uncorrel 
uncorrelated and correlated factor sco 
lutions, as a group, 
group, yielded significant F ratios 


GAAL, either group of solutions 
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differences reported here. However, only the promax-varimax and the pro- 
max-equamax, both oblique solutions, yielded significant Ё ratios on three 
different factors: sociability, energy, and attentiveness. Moreover, it was only 
the quartimax and promax-quartimax solutions that yielded significant F ra- 
tios on factor quickness. . 
"The results of Table 3 do not show, unambiguously, which of the factorial 
procedures, either orthogonal or oblique, was most effective in detecting sex 
differences. Moreover, whether the statistical significances reported for four 
of the five factors of the GAAL—sociability, energy, attentiveness, and quick- 
ness—could be attributed to such sources as Ss’ soci 
personal preferences, or to the f 
tained from the results of this 
ried out, using different sampl 
the factorial invariance of th 


al expectations, or their 
actorial procedures used here cannot be ascer- 
exploratory study. Other studies must be car- 
es of male and female Ss, to investigate (а) 


e GAAL; (b) the factorial structure of the 
GAAL under different test instructions; and (c) the influence of different 


orthogonal and oblique rotations on the estimation of factor scores, and on 
the statistical significance of the difference between pairs of factor score means. 


E. SUMMARY 


An iterative factor analysis was performed on the scores obtained from a 
General Adaptability Adjective List (GAAL) for a combined sample of un- 
dergraduate male and female Ss (N = 202). The initial matrix of signifi- 
Cant factors was rotated E to the quartimax, varimax, and equamax 
procedures, The Promax rotation was, then, performed on each of the three 
orthogonal solutions obtaining the Promax-quartimax, promax-varimax, and 
promax-equamax solutions, Interpretation of factors was based on the pro- 


max-equamax solution and the factors obtained were sociability, energy, at- 
tentiveness, quickness, and articulateness, 


A simple analysis of factor variance of the factor score means of male and 
female Ss, on the five factors of GAAL mentioned above, yielded the follow- 
ing results. The female Ss rated themselves significantly higher than the male 
5s on factors sociability, energy, attentiveness, and quickness; no significant 
sex difference was found for factor articulateness, Factor scores were esti- 


mated by the multiple regression method and from the quartimax, varimax, 
equamax, promax-quartimax, Promax-varimax, and prom 
dures. 


accordin 


ax-equamax proce- 
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: THE FEAR OF DEATH AND THE FEAR OF DYING*! 


Department of Psychology, Wellesley College and 
Suicide-Prevention and Crisis Service, Buffalo 


с 


Lora-JEAN COLLETT AND Davip LESTER 


— 


A. INTRODUCTION 


Investigators of attitudes tow 
dardized measures of the fear of de 
(1), and Lester (3) have devise 
gated the reliability and validity of their me 


their scales cover a wide variety 


als, and even worry over the death toll on highways. 


aversion to funer: 
The purpose of the pres 
fears, attempting to distin 


the process of dying and to 
whether they are for oneself or for another. 


guish between the fe 


B. METHOD 


The Ss formed an original sample (A) and 
consisting of 25 female undergraduates. 


Thirty-eight statements were compos! 1 
death of self, fear of death of others, fear of dying of self, 


of others. The number of items in eac 


tively.? О 
The items were given to the Ss who were required to indi 
Е ment or disagreement with each item on а 6-poin 


agreement (4-3) to strong disagreement (—3)- 
C. RESULTS 


The scores of the Ss in sample А on each item w 
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ard death have realized the need for stan- 
ath (2). Sarnoff and Corwin (4), Boyar 
d measures of the fear of death and investi- 
asures. However, the items in 
of themes: fear of dying, fear of being dead, 


ent study was to devise separate measures of death 
ar of death from the fear of 


differentiate between these fears depending upon 


a replication sample (B) each 


ed concerning the four fears: fear of 
and fear of dying 


h subscale was 10, 10, 8, and 10 respec- 


cate their agree- 
t scale ranging from strong 


ere correlated with the 
wn, Massachusetts, on April 9, 1969, 
Worcester, Massachusetts. Copyright 
Suicide-Prevention and Crisis Service, 


The National Auxiliary Publications 
] Auxiliary Publica- 
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total score for the subscale to which the item belonged. All items whose cor- 


relations were not significant at the .10 level of significance (i.e., the corre- 
lation was less than 26) were eliminated. Six items were eliminated as a 
result.? 


The intercorrelations between the four subsc 
ranged from .03 to .58 and were 
of fear and the referent of the f 

` self and fear of death of others h 
two samples), whereas fear of d 
had a significant but low correla 
tailed р < .05)4 


A three-way analysis of variance for Tepeated measures was carried out on 
the data (5) with the two samples treated as a replication factor.” The 


terms of the analysis involving the replication factor were all nonsignificant. 
The largest F-ratio was 1.16 (df = 1,48). 


The Ss showed a significantly highe 
46.53; df 


when the 


ales were calculated and these 
, in general, low especially where the type 
ear differed. For example, fear of dying of 
ad a zero correlation (.03 and —.07 in the 
ying of others and fear of death of others 
tion (.46 and .40 in the two samples, two- 


r fear of death than of dying (F = 
= 148;5— :01). "They also showed à significantly greater fear 
Self was the referent than when another was the referent (F = 
14.74; df = 1,48; 5 < 01). The interaction of death/dying with self/other 
Was also significant (F = 23:78. df = 148; < 01 ). The mean scores per 
item for each of the subscales (a measure used to control for the length of 


each scale) were fear of death of self, .48; fear of death of others, 59; fear 
of dying of self, .33; and fear of dying of others —.78. 


D. Discussion 
The low intercorrelations b 


fear of dying of self, fe 
indicate the potenti. 
rather than indiscri 
would lead to an 
The low correlatio 


etween the four fears (fear of death of self, 
ar of death of others, and fear of dying of others) 
al usefulness of differentiating these four specific fears 
minately grouping all items in the same scale. Such a step 
improvement in available measures of the fear of death. 
ns may reflect a general death fear which is tapped by all 
death scales, and in cases where the scales share а common process (death/ 
dying) or a common referent (self/other) they may reflect this shared object. 


7 The test used Was a one-tailed test of significance, 

4 The intercorrelations for the two samples were death НЫ уна. E 
26, death self/dying self 24 and 41, death self/dying others .09 and .58, death 
others/dying self .03 and —.07, 


death others/dying others 46 and .40, and dying 
self/dying others .28 and 40. A correlati is necessary for signifi- 


cance at the .05 level (two-tailed test). 


5 The interaction terms in analysis of Variance were homogeneous (5). 
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ying of others was least feared. In fact for this 
subscale the mean score was negative, indicating that Ss faced this topic in 
ather than avoiding it. The differences in scores on the four scales 
are not a result of the different lengths of the four scales, since this was con- 
trolled for in the analysis of variance by using the mean score per item for 
each scale and each $. However, the differences could be a result of differ- 
ences in the strength of the wording of the items of different scales. Such an 
occurrence would not be of importance when other variables are being com- 


pared to scores on the four scales. 
E. SUMMARY 


ar of death of self, fear of dying of self, fear of death of 
others were devised. The low correlations be- 
dicate the potential usefulness of differentiating 


Of the four fears, fear of d 


general г 


Scales to assess fe 
others, and fear of dying of 
tween scores on these scales in 


between these four fears. 
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IN HUTTERITE COMMUNAL SOCIETY*? 


Department of Psychology, University of Manitoba 


SHIRIN SCHLUDERMANN AND EDUARD SCHLUDERMANN 


A. INTRODUCTION 


The Hutterite society is four hundred years old. The three main tenets of 
the society are (а) Anabaptist ideology; (b) collectivist ideology; and (c) 
the principle of realization: i.e., commitment to translating ideology into ac- 
tion. Like other communal societies—e.g., the Kibbutzim—the Hutterite so- 
cieties are agricultural communities. Hutterites firmly believe in the principle 
and practice of sharing everything equally according to needs. In the Hutter- 
ite societies cooperation, equality, fraternity, justice, freedom from competi- 
are some of the basic principles espoused. In keeping 
men and women wear distinctive uniform 
approximately 100 members, live in 


g houses built by themselves, eat in a common dining 
one else, and abolish all visible signs of status 
symbol usually associated with rank, role, and position in society. Interested 
readers may read books by Victor Peters (6), John Horsch (3) and Paul 
Gross (2) to gain interesting and relevant information on social structure 
and organization, economic life, and religious beliefs of the Hutterites. At 
present, Hutterite colonies are located in Alberta, Manitoba, Saskatchewan, 


South Dakota, and Montana. MEN 5 | 
The child rearing practices in the communal societies involve “collective 


education.” The responsibility for care, training, and discipline of children 
becomes the moral, economic, and ideological responsibility of the commu- 
nity. The specially trained members of the society assume much of the pa- 
rental responsibility for educating the children. Collective upbringing in the 
Hutterite society has the following features: | 

During the first six weeks of his life, the baby is looked after by a female 


Provincetown, Massachusetts, on April 9, 1969, 
ew Street, Worcester, Massachusetts. Copyright 


tion and exploitation 
with their ideology, the Hutterite 
dress, dwell in colonies consisting of 


similar type dwellin 
room the same food as every 


ial Office, 
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member of the community. Thereafter the mother takes full charge of 
baby. The first three years of the child's life are spent in the special care о 
his mother supplemented by abundant care by other close relatives and com- 
munity members. Children attend the kindergarten between the ages of three 
to six years. А special kindergarten woman (mostly a grandmother) is in 
charge of these children for most of the day. 
years, children attend the public school which is on the colony. The school 
day is supplemented by one hour of instruction in German and the Bible m 
the morning and evening, The German school teacher supervises and disci- 
plines the school-age children. The colony leaders—i.e., the minister, the busi- 
` ness manager, the German teacher, the kitchen woman, etc.—are also jointly 
responsible for the proper conduct and discipline of the child. After the age 
of 16 boys and girls leave school and take their sex-appropriate adult roles in 
the Hutterite colony. Since the colonies are of manageable size, face to face 
interaction and constant supervision of colony members are possible. 

It was one of the aims of this investigation to examine the suggested per- 
ceptual regularities and universalities in interperson perception as mentioned 
by Dubin and Dubin (1) and others. It will be of interest to examine if 
these perceptual constancies in varied cultural contexts ch 


or quantitatively; and if only the latter, what might be t 
quantitative changes as estim 


Between the ages of six to 16 


ange qualitatively 
he extent of such 
ated by psychological instruments (i.e., Seman- 
and universal are the trends observed in social 
ption in a contrasting cultural milieu? The sec- 

а and number of the 
basic dimensions underlying 


le by eight to 13-year-old 
boys and girls. How do the аре and sex influence the children's 


B. METHOD 


І. Subjects 

Three hundred and fifty- ere selected from 20 Hutterite 

colonies in Manitoba. to 13-year-old boys and girls cur- 

rently enrolled in the public schools in Hutterite colonies. The Ss were se- 
lected on the basis of their school i 


ilities from average to somewhat 
above-average. In order to control somew! 
ferences in intelligence on conceptualizati 
dents were eliminated from the sample. S, 
onies form a homogenous social group lo 


9n, very dull and outstanding stu- 


ocioeconomically, the Hutterite col- 
Cated in rura] areas, 


a> 
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2. The Instrument, Concepts, and Scales 


Osgood and Osgood eż al. (4, 5) have reported a series of cross-cultural 
studies with a variety of semantic scales, and have also rated a great variety 
of concepts. The factor structure of the semantic space has consistently re- 
mained stable across the varied language-culture groups studied. It appeared 
then that a well-standardized and tested tool in a great variety of cross-cul- 
tural and cross-linguistic groups would be an ideal tool for cross-cultural 


studies. 
The social role concepts that 


scales were as follows: 
1. Family roles (males) : Grandfathers, fathers, brothers. 


2. Family roles (females) : Grandmothers, mothers, sisters. 
3. Occupations (males) : Farm foremen, pigmen, doctors. 
4. Occupations (females) : Kitchen women, garden women, kindergarten 


were rated on a set of 15 bipolar semantic 


teachers. 
5. Self-concepts: Me (as I am), Me (as I would like to be). 


6. Friends: Boy-friends, girl-friends. 
The set of 15 bipolar semantic scales consisted of weak-strong, important- 


ast, interesting-boring, afraid-brave, busy-lazy, dull-bright, 
soft-hard, unfriendly-friendly, gentle-rough, dirty- 
clean, active-quiet, small-large. The selection of scales was based on relevance, 
simplicity, representational character, and empirical variance. 'The scales were 
1 ive empirical tests before the final selection was 


subjected to a series of success! befo 
made. Care was taken to alternate the polarity direction of the semantic scales. 


Children rated the concepts on a five-point scale. 

3. Investigative Procedure 
rential was administered in small groups for the differ- 
The procedure of administration, the forms of in- 
tion of materials (Form II) were identical for 
actly the same as those described by Osgood 


common, slow-f 
useful-useless, sad-pleasant, 


The semantic diffe 
ent age-groups separately. 
struction, and the presenta 
all age groups and were kept ex 


et al. (5). 
d C. RESULTS 


jects were subdivided into two age subgroups 


'The male and the female sub. r i 
ars to 13 years. Correlation matrices 


(a) cight years to ten years, (b) 11 ye 
of 15 X 15 scales for the four age-sex subgroups were calculated separately, 


and factor analyzed by the principal axes method. The axes were rotated 


orthogonally according to the Varimax criterion. 
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Table 1 indicates that only the first factor in each subgroup accounts for 
a sufficiently large percentage of the variance to be considered as a ics 
factor. All other factors in the groups are only specific factors. The principa 
factor matrix showed that 13 out of 15 scales had substantial loadings "e 
greater than .40) on the first factors. There are only two or three scales wit 
any substantial loadings on the second factors and only one or two scales on 
the third factors. In all subgroups, even though the first factor can be called 
a common factor, it does not account for more than one-third of the total 
variance; therefore, the factor space cannot be interpreted to be a unidimen- 


А 9 2 asts 
sional space. The failure to obtain one or two or more common factors cast: 
doubt on the initial assumption that a few b 
13-year-old children's perception of social ro 


view of the results obtained in previous stu 
as to wl 


asic dimensions underlie eight to 
les in the Hutterite colonies. In 
dies (5, 4, 7), one may question 
as emerged. The finding is all the 
the investigators in their previous 
cales and the same social roles as 
and have obtained a stable semantic space (i.e., Eval- 
and Activity) as reported by Osgood (5, 4). 


hy no stable common factor space h 
more challenging in view of the fact that 
study (7), have used the same semantic s 
were used in this study, 
uative, Potency, 


TABLE 1 


PROPORTION oF VARIANCE ACCOUNTED FOR BEFORE ROTATION 


Cumulative 
Subgroups 


percent 

Ages Fi y Е; Е, уагіапсе 
Young girls 8-10 yrs. 28.33 10.02 7.07 6.08 51.99 
Young boys 8-10 yrs, 29.80 10.07 7.50 6.00 53.38 
Older girls 11-13 yrs, 33.04 10.54 6.92 6.22 56.73 
Older boys 11-13 yrs, 33.40 11.61 6.95 5.90 57.86 


D. Discussion 
l. Factor Structure 
As is seen from Table 1, the first three factors accounted for less than 50 
percent of the total variance in each subgroup. This low common factor vari- 
ance reflects generally low intercorrelations between the 15 semantic scales 
and could be accounted for in different ways: Firstly, there is the possibility 
that the subjects failed to understand 


the instructions clearly. Secondly, the 
Ss may have had a somewhat vague u 


nderstanding of the descriptive adjec- 
tives and they may have been uncertain in t 


he use of these adjectives in the 
description of people. During the administration of the SD, special steps were 
taken to eliminate, as far as possible, these two sources of error. Peculiarities 


^» 
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of the social environment in which Hutterite children grow up provide a very 
important third source of influence which would lower the common factor 
variance. Hutterite children of the age range studied usually have intimate 
contact with only one individual who fills a given social role in their own 
colony (e.g., farm foreman). It is therefore quite possible that the personal 
characteristics of the individual who holds a given office on their own colony 
influences the children's ratings of general social roles. 

In the principal factor matrix of all the four subgroups, the first factor 
appeared as a global undifferentiated perception of the various social roles. 
Very likely the young respondents looked at the bipolar adjectival semantic 
scales in an essentially good-bad dimension. It is good to be brave, strong, in- 
teresting, useful, clean, etc. If persons possess these qualities they are “good,” 
if not, then they are "bad." The next two factors (which can be regarded 
as specific factors), show interesting trends. The semantic scales "strong," 
"hard," and "rough" which compose F» in all four subgroups, appear repeat- 
(P) dimension in the cross-cultural researches of Osgood 
utterite children show some observable 
trends suggesting the early stages in the development toward a clear-cut E 
(Evaluative) P (Potency) A (Activity) factor structure. It is likely that, 

grow older, they may be able to make 


as the young respondents of this study у l 
the kind of differentiation in their perceptions that researchers using the se- 


mantic differential are accustomed to obtaining. 

2. Social Perception 
It may be said that the greater the number of common factors used by ü 
describe social roles, the greater will be the cognitive 
When the results of this study are compared with 
the results of studies (using the same social roles and the same semantic 
scales) on American children of the same age range аа 7) the HE n 
ception of Hutterite children appears to be quite undi ong Б п this 
section an attempt will be made to suggest some characteristics o utterite 
society which would delay the development of differentiated social perception 


i ildren. 
ine Te Lat considerable difficulty in noticing differences in so- 
cial roles, because in Hutterite society all visible status symbols (which are 
the most obvious cues available to American children in the perception of so- 
cial differences) are deliberately abolished. Hutterite society does not reward 
high achievement by way of distinguishable high status position. шее 
ideology deliberately discourages the tendency to look at certain social roles, 


edly as the Potency 
(4). The results obtained from the H 


group of respondents to 
differentiation involved. 
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and certain types of work as superior or inferior. Since attaching greater 
value to certain social roles over others does not receive any overt positive re- 
inforcement, Hutterite children have few incentives to pay special attention to 
characteristic features of different social roles. It is suggested that the devel- 
opment of differentiated social perception among Hutterite children is de- 
layed because Hutterite children have fewer observable cues for role differ- 


entiation and because they have less encouragement for learning the differences 
between social roles. 


E. SUMMARY 


Three hundred and fifty-nine Ss, eight to 13 years old, selected from 20 
Hutterite colonies in Manitoba, were asked to describe 16 social roles on 15 
semantic differential scales. Factor analyses of the 15 X 15 four subgroups 


correlation matrices yielded only one common factor which accounted for 
only one-third of the total variance; the rest of the factors emerged as spe- 


cific variances. No clear-cut indication of a stable EPA semantic space was 
obtained for any of the age-sex groups. The assumption of a few basic dimen- 


sions underlying Hutterite children's perception of social roles became doubt- 
ful. 
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PEER INFLUENCES ON PRESCHOOL CHILDREN’S 
WILLINGNESS TO TRY DIFFICULT TASKS** 


Department of Educational Psychology, University of Georgia 


E. PAUL TORRANCE 


A. INTRODUCTION 

fail to learn because they are unwilling to try 
Others fail because they compulsively insist 
r them to master. Thus, to guide a child's 
t be able to influence his willingness to try 


Some children apparently 
more and more difficult tasks. 
upon trying tasks too difficult fo 
learning effectively a teacher mus 


tasks of appropriate difficulty. 
In another source, the author (4, 5) has presented evidence to indicate 


that learners differing in tendencies to attempt the difficult need different 
kinds of feedback concerning their performance. Learners low in willingness 
to attempt the difficult appear to need evaluative feedback and external cri- 
teria. Those high in willingness to attempt the difficult seem to be hampered 
by this kind of feedback and need opportunities to test their own limits and 
push their limits forward. Thus, teachers should be aware of individual dif- 
ferences in willingness to attempt the difficult and to differentiate feedback 
experiences accordingly. 

Almost all personality studies of productive, creative people show that 
such people enjoy а calculated risk in which success or failure depends upon 
their own ability (6). In fact, there seems to be a positive relationship be- 
tween willingness to attempt difficult tasks and achievement. Studies of the 
lives of eminent people who have made important social contributions sug- 
gest that one of the essentials for high achievement is a strong tendency to 
attempt difficult tasks. Instead of just adapting ог adjusting to their environ- 
ment, they deliberately exposed themselves to possible failure and committed 
themselves to goals that required sustained expenditures of intellectual and 


physical energy and continued changes in behavior. Thus, it would appear 
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that it is important to society that schools develop in children the ability to 
estimate their limits and a willingness to attempt difficult tasks. 

The present study was conducted as a part of a sequence of activities to 
teach children to estimate more accurately their limits. It was designed spe- 
cifically to determine the influence of threc different peer conditions on will- 
ingness to attempt difficult tasks. The design of the study was suggested by 
the author's observations in working with United States Air Force personnel 
in advanced survival training that men were more likely to attempt difficult 
tasks and to test their limits in pairs than in individual or large group situa- 
tions. The design of this study was also influenced by the finding of Miller, 
Bensen, Seidman, and Meeland (2) that men have a greater tolerance of 
electric shock in the presence of a partner than when alone. 


It was hypothesized that 
would be more w 


when alone or bef 


the five-year-old children in the present study 
illing to attempt the difficult when placed in pairs than 
ore a large group (a class of 22 peers). 


B. PROCEDURE 


1. Subjects 


The subjects were 44 five-year-olds enrolled in an experimental prepri- 


mary educational program following the Creative-Aesthetic approach (7, 8). 
The subjects were equally divided into two classes each taught by a master 
teacher assisted by д part-time teaching aide. "The classes were integrated and 
roughly represented д stratified sample in a medium-sized Southern town. А 
deliberate attempt had been made to include some disadvantaged children in 
each class. While the majority of children in the two classes studied are 
white, almost all of the remaining children in the school are black. 


2. The Setting 

Essentially the same behavior: 
rooms. These behavioral settings 
perimentation, manipulation 


al settings were maintained in both class- 
were deliberately d 
› sociodr: 
expression. The experimental materi 


esigned to encourage ex- 
amatic play, Construction, and creative 


1 als (bean bags and waste baskets) used 
in this study had been available in both classrooms from the beginning of the 


school year but had been used almost entirely during free activity periods. 
The author was accustomed to working with each class for about one hour 
each week, Early in the school year during a free activity period, the author 
had encouraged some of the children to try to throw the bean bags into the 
basket from different distances. No systematic work had been done either to 
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teach the children to estimate their limits or to encourage them to try the 
task at distances difficult for them to succeed. Any such learning prior to the 


present study was incidental. 
3. Experimental Material 


'The training and test task used in this study was adapted from Stark- 
weather's (3) test of willingness to try difficult tasks. Starkweather’s test 
apparatus consists of a target game in which a ball is rolled from varying 
distances to a target enclosed in a box. A marker consisting of a strip of 
cloth calibrated for distance is used in determining what is difficult for each 
child and in setting the easy and difficult task. Prior to the development of 
this instrument, Starkweather (3) had used a bean bag target game as a test 
of willingness to try difficult tasks but had abandoned н. 

In the present study, the experimental equipment consisted of two sets of 
three 12-ounce bean bags each, a marker designating five levels of difficulty, 
and a waste basket. The equipment used in the study was quite similar to that 
hich had been available in the classrooms from the beginning of the school 


w 3 > beg 
d been used prior to this time. 


year. However, no marker ha 


4. Orientation to the Task 


The target game was introduced as a part of the author’s usual work with 
the children and was preceded by the reading and discussion of Mary Mc- 
Burney Green's Is It Hard? Is It Easy? (1). As the book was read, the au- 


i s ү it was easy for each of them to 
hor : Idren talked about whether it was easy 
thor and the childrer catch and throw a ball, take a bath, hold tickly 


skip, tie shoelaces, climb trees, ] i b 
A i t, and pound nails so they go in straight. They were 


worms, whistle, somersaul i 
told that they could make the bean bag target game easy or difficult by stand- 


ing close or far away from the target. In all three conditions, children were 
given opportunities to practice the task to “warm up” and to find out what 


was hard and easy for them. 


5. Peer Conditions 


Under all three conditions, each subject was given a practice session, an 
ex саба of the scoring system (5 points for hitting the target from E, 4 
a ts for hitting it from D, 3 points for hitting it from C, etc.), and three 

n 


ia it the target. a 

к> oe ditions А and B, children were called from the classroom in ran- 
ndi a : : 2 

1 d to play the target game in the wide hallway just outside the class- 

ee and markers were set up in such a way that the 


room. Two sets of targets 
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players were back to back and could not see what the other was doing. Un- 
der Condition A, the children practiced and played the game in pairs; under 
Condition B, they practiced and played alone. Under Condition C, they per- 
formed before the entire class and each class competed with the other for the 
ownership of a large ball unlike any available to them. After each child had 
performed, he was allowed to add whatever points he scored to the class score 
being kept on the blackboard. (The two classes achieved identical scores.) 
This procedure netted 22 subjects each in Conditions A and B and 44 sub- 
jects in Condition C. The 44 children in Condition C were the same ones 
who were in Conditions A and B. The study was conducted after the chil- 
dren had been in the preprimary program for about five months. 


C. RESULTS 


A rough test of the hypothesis already stated can be obtained by comparing 
the proportion of easiest tasks chosen under each condition. These percentages 
were as follows: Condition A (Pairs), 13.7 percent; Condition В (Alone), 
36.5 percent; Condition C (Class), 86.5 percent. In all cases, the differences 
in proportions are statistically significant at the one percent level of confidence. 
When working in pairs, children least frequently tried the target from the 
easiest position. When performing before the class, they were most conserva- 
tive and most frequently tried the target from the easiest position. 

A somewhat more precise test can be made by obtaining a distance score 
for trial, adding the distance score for the three trials by each child, and 
comparing the differences in means for the three conditions. The means and 
standard deviations, respectively, for the three conditions are as follows: 
Condition A (Pairs) 12.1, 3.3; Condition B (Alone), 6.7, 3.9; Condition 
C (Class), 3.8, 1.6. An analysis of variance yielded an F ratio of 64.64, sig- 
nificant at better than the .01 level. A ż ratio of 4.61 was obtained for the 
difference in means for Conditions A and B and one of 1.91 was obtained for 
the difference between Conditions B and C, the former being significant at 
better than the .01 level and the latter being significant at about the .05 level. 


D. Discussion 


These results suggest that young children tend to be most willing to try 
difücult tasks when working in pairs and least willing to do so when per- 
forming before the entire class. In individual situations, they seem to bs вак 
to adjust the difficulty of the task to their skill level than under the "patr 


and "class" conditions. 
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One might question the extent to which one can generalize from the lim- 
ited number of subjects included in this study. One might also wonder if the 
Creative-Aesthetic orientation of the program might not have influenced the 
results; Under the Creative-Aesthetic approach, there is constant emphasis 
on attempting difficult tasks and much effort is expended in developing a 
psychologically safe environment. "Thus, it would be expected that children 
under the Creative-Aesthetic approach would feel freer than under other 
conditions to make errors or fail in the presence of the group. In both classes, 
however, the children were extremely reluctant to try the difficult before the 
class. Confidence in the results is strengthened by the fact that they were 
quite consistent within treatments and the differences between treatments 
were quite strong. 

From these results, it would be expected that having children work on 
school tasks in pairs would influence them to try difficult tasks, while requir- 
ing them to perform before the entire class would discourage them from do- 
ing so. In the latter condition, none of the children in either class tried to 
hit the target from the five-point line and only two of them attempted to hit 


or three-point line. When working in pairs, only three of 


it from the four- 1 pa 
he easiest position; most of them 


the 22 children tried to hit the target from t 
held doggedly to the four- and five-point lines. 
E. SUMMARY 


children have to be willing to attempt dif- 
and growth, teachers should be able to 
e to attempting tasks of appro- 


In order to learn and to grow, 
ficult tasks. To facilitate learning c 
create the social or peer conditions conduciv 


priate difficulty. А ied llores tin us aogier 
уа /pothesized that five-year-old children ап ex} а А 
рн ya illing to attempt difficult tasks when 


mary school program would be more w . t 
н re their entire class. 


igi" fo 
laced in pairs than when alone or be 1 | 
: The a me in this study was a target game m which the children at- 


tempted to throw bean bags into a basket from distances that pe 5 task 
either easy or difficult, depending upon the choice of the child. In “a ү 
A, the children played the game in pairs; in Condition B, alone; A ү: 
Condition C, before the entire class. There were 22 children each in Condi- 


i 4 in Condition C. 
id Бшш the hypothesis and further suggested that 


Its clearly supported € s 
PE idu least willing to attempt the difficult when performing before 


the entire class. 
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THE USE OF MATRIX RATHER THAN SERIAL 
PRESENTATIONS OF NONSENSE SYLLABLES* 


University of Windsor, Ontario 


ALAN KRICHEV AND BRENTON HAZLEWOOD 


A. INTRODUCTION 
e rote memorization of nonsense syllables or 


similar material have presented such material in a sequential (serial) list 
form. This helps to organize the material, allows the formation of associa- 
tions between items and is the method typically employed when one is faced 
with the task of memorizing written material. 

However, for a given experiment such a presentation can have drawbacks. 
From a practical standpoint, it can be difficult to present a long list, especially 
by the use of a slide or overhead projector to groups of Ss. In addition, there 
are two disadvantages which may influence the results: (@) when a consider- 
able amount of low-meaningfulness material needs to be learned, the monot- 
ony of the task may influence the results, and (^) it isa task which is familiar 
to most Ss and, therefore, may cause them to lose interest quickly. 

Such experimenters as Anderson and Ross (1), Hebb (2), and Woodworth 
(3) have presented material to be memorized in a matrix form (e.g, five rows 
of five items cach). This method has the advantage of saving space and of be- 
ing different from the usual method of organizing material to be memorized. 
At the same time, it could be used to vary the task when considerable mem- 


orization is needed. | 

It was the purpose of the present paper to compare the matrix with the 
sequential list presentation methods to ascertain whether the methods could 
be used interchangeably. 


Most experiments requiring th 


B. METHOD 
1. Subjects 


versity of Windsor undergraduates enrolled in an in- 
There were 32 men and 28 women. Eight men 
assigned to each of four groups: serial list- 


The Ss were 60 Uni 
troductory psychology course. 


and seven women were randomly 
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position (SP), serial list-random (SR), matrix-position (MP), and matrix- 
random (MR). 


2. Procedure 


Testing was done on the Ss in groups of five, all of whom were from the 
same experimental group. They were brought into a classroom which had a 
movie screen in the front and were seated 15 fect from the screen and facing 
it. Each group of five was shown nine CVC trigrams on the screen. Groups 
SP and SR were shown the trigrams arranged in a serial list in the standard 
fashion, although the list was unnumbered ; Groups MP and MR were shown 
them arranged in a 3 X 3 matrix. Each $ was given 15 prenumbered answer 
sheets with either nine lines arranged vertically or a blank matrix. 

Groups SP and MP were told to write down each trigram they could re- 
member in exactly the same Position or square as it appeared on the screen. 
Groups SR and MR were told to write down each trigram they could re- 
member in any position or square, regardless of where it appeared on the 
screen, 


Ss received a total of 15 trials each. Each trial consisted of a five-second 
Presentation period and a 30-second recall period. There were five seconds 
between trials to allow the Ss to turn over their answer sheet so they would 
have no knowledge of results readily available. Ss repeated the procedure for 


all 15 trials, even if they recalled all nine trigrams correctly on any given 
trial. 


C. RESULTS 


Table 1 presents the percentage of correct responses for each group at the 
end of each block of three trials. Analysis of variance indicated that neither 
the type of group (F — 2.24; df = 3; p > .05) nor the group by trial in- 
teraction (F = 1.07; df = 12; р > :05) had a significant effect on the 
amount of material learned. The sex of the Ss and whether they had to re- 
spond by exact position also had no effect. 


TABLE 1 
NUMBER OF CORRECT RESPONSES PER BLOCK or THREE TRIALS 

Trials Total % 

Group 1-3 4-6 7-9 10-12 13-15 correct correct 
5Р 123 251 323 370 391 1458 72 
SR 120 211 264 315 322 1232 61 
MP 130 279 343 376 386 1514 75 
MR 134 256 318 350 367 1425 70 
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D. ConcLusion 


The present results indicate that there is no essential difference between 
the matrix and the sequential list methods of stimulus presentation. Hence, 
the choice of the matrix form is indicated when the technical issues of limited 
presentation space or task variability require a different or more compact form. 
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INTERSTRESSOR CONSISTENCY OF SUBJECT 
DIFFERENCES IN AUTONOMIC ACTIVITY* : 


U.S. Army Rescarch Institute of Environmental Medicine, Natick, Massachusetts 


Gary W. К. PATTON 


А. INTRODUCTION 


_The purpose of this paper was to determine whether there are consistent 
differences among Ss in the character of their autonomic nervous system 
(ANS) response to varied thermal and psychological stresses. The prediction 


was made that, regardless of the exact ANS response pattern elicited by the 
stress (2), (a) Ss would respond consistently; that is, an $ would exhibit a 
characteristic amount of sympathetic nervous system (SNS) activity in a par- 
ticular measure under a varicty of stresses, and (^) individuals classed as 
sympathetic (12) under rest conditions would display relatively more SNS 
activity in all autonomic measures during each stress than Ss characterized as 
parasympathetic. In terms of measures employed here, greater SNS activity 


was indicated by increasing pulse rates, systolic blood pressures, palmar skin 


conductances, and decreasing instep skin temperatures (5). 


Lacey and Lacey (7) showed that consistent intersubject differences occur 
vever, they did not find that one S consistently ex- 


r of simultaneously-recorded SNS measures under 
ast, Jones (3) and Jost, Ruilmann, Hill, 
such as hypertensives 


for any given measure; hov 
ceeded another in a numbe 


varying stress conditions. In contr 


and Gulo (4) demonstrated that certain patient groups, 
consistently display more SNS activity than normal Ss in a 


corded autonomic measures under a variety of non- 
dy extended and elaborated previous work in 


the following ways: (4) Since neurotic patients consistently exhibit more 
SNS activity in a number of measures than normal individuals, ап attempt 
was made io determine whether normal individuals can likewise be differenti- 
ated in terms of their SNS responses to stress. (b) The causes of any signifi- 
cant differences in SNS activity between sympathetic and parasympathetic $$ 


and neurotics, 
number of concurrently-re 
thermal stresses. The present stu 
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during stress were investigated. Such differences may appear either because 
sympathetic Ss characteristically display more SNS activity than parasympa- 
thetic Ss under nonstress, as well as stress conditions (i.e., they are basically 
“biased” toward greater SNS activity under any circumstance), or because 
sympathetic Ss show greater increases in SNS activity when confronted with 
a stress. Efforts were made to determine whether either or both of these fac- 
tors were operative. (c) Cold and heat exposures were included among the 
four stress conditions to determine if S differences recorded during psycho- 
logical stresses are maintained during thermal stresses. As noted by Ruch, 
Patton, Woodbury, and Towe (10), thermoregulatory activity, like response 
to psychological stress, is mediated by the ANS; therefore, if an $ is hyper- 
reactive to psychological stress, he may likewise be highly responsive to ther- 
mal stresses. 


B. METHOD 
1. Subjects 


Twenty-four U.S. Army enlisted men between the ages of 19 and 26 served 
as Ss in this experiment. Prior to the study, autonomic balance scores (12) 
were recorded for each S. Based on these results, Ss were divided into two 
equal groups consisting of (a) the 12 Ss with the lowest (most sympathetic) 
and (5) the 12 Ss with the highest (most parasympathetic) scores. The mean 
and standard deviation for Ss placed in the sympathetic category were 68.66 
and 2.58, respectively; the corresponding values for Ss assigned to the para- 
sympathetic group were 78.34 and 5.39. After corrections for heterogeneity of 
variance were made (1), aż test indicated that these two groups differed sig- 
nificantly in autonomic balance (2 = 5.61, p < .001). 


2. Experimental Conditions 


Each 8 was individually subjected to the following four stress conditions: 
(а) S was exposed to a temperature of 50+1°F. with 50 percent К.Н. and 
a wind speed of less than two mph. for 60 minutes. (5) S was exposed to a 
temperature of 110+1°F. with 20 percent R.H. and a wind speed less than 
two mph. for 60 minutes. (c) $ was shown two technicolor movies of surget 
entitled "Debridement of Wounds of the Extremities” and “Management 
and Technic of Pulmonary Lobectomy." (d) S was asked to solve 35 five- 
and six-letter anagrams. The 80-second time limit for solution of each апат 
gram was short enough that every S failed to solve at least one-half of the 
anagrams. А fifth session in which S watched a comedy movie provided non" 
stress comparison values for later statistical analyses. 
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3. Experimental Design 


А ame 8 underwent one experimental condition per day for five consecutive 
ays, each session lasting 60 minutes. Conditions were administered on dif- 
ferent days to prevent the artifactual appearance of autonomic response con- 
stancy based on close proximity of conditions in time, a problem noted by 
Wenger, Clemens, Coleman, Cullen, and Engel (13). Likewise, the testing 
time for each S was randomly varied from day to day to eliminate consistency 
in autonomic response based on similarities in diurnal cycle. 

All five conditions could not be counterbalanced because of difficulty in 
scheduling climatic chambers for heat and cold exposures at appropriate times. 


However, a partially counterbalanced schedule was employed whereby all Ss 
were given the heat exposure on the first day of the experiment, the cold ex- 
hree intervening days, the three 


posure on the fifth or last day, and on the t 
remaining conditions were counterbalanced among Ss for order and sequence. 


On those days when the $ was not heat- or cold-exposed, room temperature 
was maintained at 80-=2°Е. with 50 percent R.H. and less than two mph. 


wind speed. 
4. Procedure 


andial effects, Ss were not allowed to eat in the two 
iving at the laboratory, 5, who 


To eliminate post-pr 
y, upon arri 
ireclining position in a chair 


hours preceding testing. Each da 
was dressed only in shorts, Was placed in a semi i 
and then outfitted with equipment to record pulse rate, systolic blood pres- 


sure, palmar skin conductance, and instep temperature. The first three mea- 
n a Grass Model 7 polygraph, while the fourth was 


read by a Yellow Springs Model 47 tele-thermometer. 
a Grass PTTI photoelectric pulse transducer was 


To record pulse rate 
placed onthe pem БИЕ? The transduced signal was then 
fed back to a Grass Model preamplifier on the polygraph. 
A 30-second sample of each minut te recording formed the basis 
for subsequent data reduction and analysis. & M 

A. combination of a Grass Model 7P8B sphygmomanometer and E & |! 
automatic cuff pump was used to measure systolic blood pressure at two-min- 
ute intervals. The microphone from the sphygmomanometer preamplifier was 
placed over the brachial artery at the antecubital fossa of the S’s left arm to 
record the Korotkov sounds. 3 

Palmar skin conductance was measured by passing a 50 pamp. current be- 
tween two .80-inch diameter zinc chamber electrodes, which were filled with 
zinc sulfate electrode paste (9) and placed on the left palm and dorsal sur- 


sures were recorded o 


of the S’s left hand. 
7P4A tachograph 
e's pulse ra 
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face of the left wrist. This signal, originally recorded in ohms, was then. fed 
to a Grass Model 7P1A low-level DC preamplifier on the polygraph. The 
final record was sampled every 60 seconds, and the recorded values were con- 
verted from ohms resistance to square root of the conductance (14) for sub- 
sequent data analyses. | 

Finally, instep temperature was monitored for 20 seconds every four min- 
utes by a combination of a Yellow Springs Model 421 surface temperature 
thermistor, which was placed on the instep of the S's right foot, and а Yel- 
low Springs Model 47 tele-thermometer. 


C. RESULTS 


1. Individual Measures During the Nonstress Condition 


To determine if sympathetic Ss displayed significantly more SNS activity 
an parasympathetic Ss during nonstress conditions, the mean absolute values 
of pulse rate, systolic blood pressure, palmar conductance, and instep tempera- 
ture recorded from these two groups during the comedy condition were con- 
trasted by ¢ tests. As shown in Table 1, sympathetic Ss displayed more SNS 


activity in three of four measures ; however, the differences between the groups 
were significant only for instep temperature. 


th 


2. Composite Index of SNS Activity During the Nonstress Condition 


To gain a more general estimate of SNS 


dition, for each of the 24 Ss, the mean abs 
transformed into a Т score 


activity during the nonstress con- 
olute value for each measure was 
to produce a distribution with a mean of 50, а 
standard deviation of 10, and a sample size of 24. The four T scores, each 
corresponding to one of the four autonomic measures taken from an § during 
one stress condition, were then added together, Finally, the mean composite 
score for the 12 sympathetic Ss was compared with that for the 12 parasym- 
pathetic Ss by means of a № test. As shown in Table 1, the composite index 


of SNS activity during the nonstress condition was significantly greater for 
sympathetic Ss than for parasympathetic Ss, 


3. Change in Individual Measures from Nonstress to Stress Conditions 


The next analysis was designed to determine (a) if Ss display consistent 


i with respect to their fellows under varying stress 
if sympathetic Ss display greater increases in SNS ac- 
tivity than parasympathetic Ss, With the use of values recorded during the 
comedy condition as nonstress baseline figures, 


increases in SNS activity 
conditions and (2) 


a measure of change, the au- 
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tonomic lability score (6), was calculated for each of the S's four autonomic 
measures during each of the four stress conditions. 

To determine if Ss display characteristic changes in a particular measure 
under varying stress conditions, the autonomic lability scores for the 24 Ss 
were ranked from 1 to 24 during each of the four stress conditions. The sim- 
ilarity of the rankings from one stress condition to the next was then tested 
by means of a Kendall coefficient of concordance, the preceding statistical an- 
alysis being completed once for each of the four autonomic measures. In all 
four cases, a significant tendency for interindividual changes in autonomic ac- 
tivity to remain stable from one stress to the next was noted (FF systolic blood 
pressure = 54, p < .001; W suse rate = .48, p < .01; I instep temperature = AT, 
p< 01; W pamar conductance = .41, $< .05). 

The preceding analyses illustrated consistency of response change; however, 
they did not indicate whether sympathetic or parasympathetic Ss displayed 
greater increases in SNS activity. To answer the latter question, the auto- 
nomic lability scores for each of the four autonomic measures were subjected 
to analysis of variance (8). As shown in Table 2, with two exceptions, the 
sympathetic 8s were more reactive; however, only in the case of square root 
of palmar skin conductance was a significant difference, F (1, 22) = 9.12, 
p < .01, noted between groups. All four $ category X, condition interactions 


were nonsignificant, indicating that group differences were relatively consis- 
tent from one condition to the next. 


4. Composite Index of Change in SNS Activity 
from Nonstress to Stress Conditions 


In an analysis similar to one completed previously on the nonstress record- 
ings, S’s autonomic lability scores for all four physiological measures were 
added together to gain a more general index of change in autonomic activity 
during each stress condition, The summed scores attained by the 24 Ss were 
ranked from 1 to 24 during each stress condition, and the similarity of the 
rankings from one condition to the next was tested by the Kendall coefficient 
of concordance. The value thus produced was significant (W = .55, Р < 
.001), indicating that § rankings in terms of overall autonomic change during 
stress remain fairly stable under varying stress conditions. 

However, as shown in the bottom row of Table 2, differences between 
sympathetic and parasympathetic Ss in this measure did not reach significance 


when tested by analysis of variance. Again, the 8 category X condition intet- 
action was nonsignificant. 
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В " itions 
5. Absolute Levels of Individual Measures During the Stress Condition 


Stress effects can be gauged not only by noting the amount of change from 
nonstress to stress conditions, but also by measuring the magnitude of "m 
sulting absolute levels of SNS activity. The following analyses were inten ея 
to indicate (a) if S rankings in terms of absolute autonomic activity remal : 
stable under varying stress conditions, and (b) if sympathetic Ss display con 
sistently higher levels of SNS activity than parasympathetic Ss. à 

As in preceding analyses, the Kendall coefficient of concordance was vin 
to test the stability of S rankings of a particular measure under the four е: 
conditions. For all four measures, a significant tendency for interindividua 
differences in absolute autonomic activity to remain stable from one stress to 
the next was noted UP „уккоце nood pressure = .59, b < 001; I pise rate = 57, 
$ x 01; JP пеар conasetancs = A7, b < 015 Иер temperature == 446} < 
01). | | 

The consistency and extent of mean differences in the four autonomic mea- 
sures between sympathetic and parasympathetic Ss under the four stress con- 
ditions were tested by analysis of variance. The means in Table 1 show that, 
with the exception of instep temperature in the heat, sympathetic Ss displayed 


эш” à ‘i ; sure un- 
more SNS activity than parasympathetic Ss in every autonomic measure 


der every stress condition. The overall difference between the sympathetic and 
parasympathetic groups was significant for pulse rate, F(1, 22) = 4.30, Р < 
:05), and square root of palmar skin conductance, F(1, 22) = 7.83, р < 
.025, but not for systolic blood pressure or instep temperature. In three of the 
four analyses, palmar skin conductance being the sole exception, nonsignificant 
$ category X condition interactions were found indicating that, in these three 


ә р : 4 m- 
cases, the magnitude of the differences between the sympathetic and parasv 
pathetic groups remained relatively 


consistent from one stress to the next. 
6. Composite Index of Absolute Levels of SNS 
Activity During the Stress Conditions 


Composite indices of the absolute level of SNS activity displayed by each 


$ during each of the four stress conditions were computed as previously de- 
scribed ; that is, mean absolute values of each of the four 
recorded from an § during one session w 


four Т scores were added together, The S's composite T scores were ranked 
З НА х : р ie а е 
during each condition, and, as in preceding sections, the similarity of th 


rankings from one stress condition to the next was tested by means of a at 
dall coefficient of concordance, The resulting W was significant (W = -6% 


i 5 

autonomic pug 
e 

ere converted to T scores, and t 


— 
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p < .001), indicating that, relative to others, the total amount of SNS ac- 
tivity exhibited by an $ tended to remain stable over stresses. 

'To determine if sympathetic Ss consistently displayed more composite SNS 
activity than parasympathetic Ss, the mean summated T scores for these two 
groups were compared over the four stress conditions by analysis of variance. 
As shown in the bottom row of Table 1, the sympathetic Ss displayed more 
sympathetic activity, as evidenced by their significantly larger, F (1, 22) = 
7.52, p < .025, overall scores. Again, in this analysis, the 8 category XX con- 
dition interaction was nonsignificant, indicating that differences between the 


groups were fairly uniform from one stress condition to the next. 


D. Discussion 
The preceding analyses indicate that, relative to other Ss, Ss respond con- 
sistently under varying stress conditions, both in terms of change in SNS ac- 
tivity from nonstress to stress conditions and in terms of absolute level of 
SNS activity displayed during stress. For example, an S who has both an 
above average increase in pulse and an above average pulse rate during one 


Stress can be expected to behave similarly during other stress conditions. As 
noted by Sternbach (11), different stresses elicit different patterns of auto- 
'Thus, one stress may produce a higher mean pulse rate than 
1 S's pulse may rise and fall with the general 


hether this S’s pulse rate is high or low in an 
will always be above average in 


nomic response. 
another, and the hypothetica 
mean, However, regardless of w | 
absolute sense, during апу given stress, it 
comparison with other Ss’ pulse rates. 
Second, the results show that Ss cl 


more SNS activity during various stresse: MC cma gne rs | 
оп an individual and а composite measure basis. hough the data are no 


conclusive, they tend to suggest that differences in SNS activity between sym- 
" » 


Pathetic and parasympathetic Ss during stress exist ей because = sympa- 
thetic Ss characteristically display higher levels of SNS activity during non- 


i ied) and 
stress conditions (i.e. they have а hen the stress is applied) 


head start w! 
b) зу i re reactive to stresses. | | 
(5) sympathetic Ss аге mo me MEN 


Indiv; d in lea (i.e., previous experiences wt і ! 
tude Pater кее penne might logically be expected to ш : 
tutional differences in autonomic function among Ss, For a M м ; 
classed as sympathetic who has worked in a hospital might ме nr йш 
Surgery less stressful and, as а result, display less sympathetic н ea ir 
the film than an S categorized as parasympathetic who has agate T 
tration, Moreover, changes from "psychological" stress (the à 


assed as sympathetic consistently exhibit 
s than do parasympathetic Ss, both 
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films of surgery) to thermal stress (cold and heat exposures) might be ex- 
pected to destroy the consistency of intersubject differences. Nevertheless, in 
this experiment, changes in the identity of the stress were not of sufficient 
importance to obliterate the basic differences in autonomic activity among Ss, 
as witnessed by the consistent rankings of Ss' autonomic measures from one 
stress condition to the next. Additional support for this statement can be 
found in Table 1 where four out of five relevant S category X condition in- 
teractions were found to be nonsignificant, indicating that, on the whole, re- 
sponse differences between sympathetic and parasympathetic Ss remained fairly 
stable regardless of the condition being experienced. 


П 


Е. SUMMARY 


The purpose of this study was to determine whether there are consistent 
differences among Ss in the character of their autonomic nervous system 
(ANS) response to varied thermal and psychological stresses. The prediction 
was made that, regardless of the exact ANS response pattern elicited by the 
stress, (a) Ss would respond consistently ; that is, an 5 would exhibit a char- 
acteristic amount of sympathetic nervous system (SNS) activity in a partic- 
ular measure under a variety of stressors and (^) individuals classed as sym- 
pathetic under rest conditions would display relatively more SNS activity 
during each stress than Ss characterized as parasympathetic. 

Twenty-four adult, male Ss were classified as sympathetic or parasympa- 

+ thetic according to their autonomic balance scores and were then exposed to 
one nonstress (comedy movie) and four independent stress conditions (cold, 
heat, anagrams, and films of surgery) while recordings were taken of pulse 
rate, systolic blood Pressure, palmar skin conduct: 
Greater SNS activity was indicated by increasi 
pressures, palmar skin conductances, a 

S rankings of (а) change in SNS 
tions and (5) absolute level of SNS 


sistent from one stress to the next 


ance, and instep temperature. 
ng pulse rates, systolic blood 
nd decreasing instep skin temperatures. 
activity from nonstress to stress condi- 
activity during stress were relatively con- 


: regardless of the particular autonomic 
measure involved. Moreover, Ss classed as sympathetic had consistently higher 


pulse rates and palmar skin conductances during all four stressors than those 
Ss categorized as parasympathetic, Finally, when all four measures were com- 
bined into a more general index of SNS activity, the sympathetic Ss had con” 
sistently higher combined scores, Differences in SNS activity between sympa 
thetic and parasympathetic Ss were tentatively attributed to (a) constitutional 


differences in autonomic activity and (4) the greater reactivity of sympathet!© 
Ss during stress. 
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MAGNITUDE SCALING OF FOODS** 
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Тоштн W. ONLEY, Nina SULLIVAN, AND Susan Н. Boncyck 


———À 


What do we mean when we call something "rich" or “fattening”? The 
ate the extent to which an individual can use 


present study sought to evalu 
the concepts “richness” and “fattening” in a quantitative manner. The prob- 


lem was to gauge the extent to which various foods are perceived as “rich” 
or “fattening.” The method of magnitude estimation, common to sensory psy- 
chophysics, though not widely applied to semantics, was used (6). 

Questions of food preference and food scaling have concerned both food 
technologists and sensory psychologists in recent years. Harper gives references 
to the history of work on food preferences (3, 4) ; representative scaling stud- 
ies include those by Bruvold and Gaffey (1) and Gregson and Russell (2). 
'There has also been considerable practical interest in the derivation of ade- 
tors of food consumption and preference [cf.: Pilgrim and Kamen 
arch has applicability in many of these areas, but was 
s attention on the usefulness of а relatively new 
blem seemingly out of the realm of psy- 
as applied to food evaluation. 


quate predic 
(3)]. The present rese 
designed primarily to focu 
psychophysical methodology for a pro 


chophysics—the evaluation of meaning, 
were conducted ; foods selected for investigation 


Several group experiments 

were selected from a variety of menus published in standard cookbooks. "They 
are listed in Table 1, in one of the random orders in which they were pre- 
sented, Subjects varied in age from 19 to 45, and included approximately equal 
sex distributions. Anyone who indicated prior knowledge of "Nutrition," or 
who had been in a special dietary regimen of any sort was excluded from par- 
ticipation. In each portion of the experiment, groups chosen at random within 

pewritten sheet (see appendix). 


an Introductory Psychology class received а typew! \ r 
on which food names were listed, with specific instructions concerning the 


be made, and the standard to be used. Instructions were similar 
ad “fattening,” and for the standards of "lamb chop,” 
k.” Instructions followed the paradigm: 


judgment to 


for both “richness” 


in > : 
'ice cream," and “homogenized mil 


sal Office, Provincetown, Massachusetts, on April 26, 1969, 
Editorial w Street, Worcester, Massachusetts. Copyright by 


* Received in the 
eceived in rete at d$ Ne 


and published imme 
The Journal Press- 
А 1 This research W: 
Science Foundation for U 


e assistance of a grant from the National 


cted with th Ae pit 
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TABLE 1 
PEARSON r CORRELATION COEFFICIENTS 
Homogenized 
Standards Lamb chop Ice cream milk 
Richness ws. Calories .52* .52* 72" 
Fattening vs. Calories 6299 E nl T 
Richness Фз. Fattening .60** .g6** 92 __ 


s Significant at the .05 level. 
** Significant at the .01 level. 


Foods are often described as varying in their richness. Some foods are 
much richer than others, some not rich at all. Taking as a reference 
point, that we will call the richness of homogenized milk “100,” indicate 
for each of the foods listed, how rich it is relative to this standard. If 
it seems half as rich, call it “50.” If it seems three times as rich, call it 
300... and so forth. Assign each food on your list a number which in- 
dicates how rich you think it is, relative to the standard. 


Whatever the standard, it was assigned the number “100.” Separate groups 
were used in judgments of richness and of fattening, for each of three stan- 
dards. Thus, the experiments involved 2 categories X 3 standards X N = 
19, or a total of 114 subjects. A list of the specific foods judged is appended. 

The basic data of these experiments are series of numerical estimates of the 
magnitude of “richness” and "fatteningness" of foods relative to the three 
standards. Initially, the data were analyzed in terms of major nutrient com- 
ponents (protein, carbohydrate, fat), as well as the obvious psychophysical 
correlate, caloric content. Nutrient classes provide less information than an- 


APPENDIX 


LIST or ITEMS JUDGED FOR “RICHNESS” or “FATTENING” 


Plain pizza 


Z Roast lamb Prune whi 
E shell Cheeseburger Plum pudding with sauce 
een 1 Eggplant Asparagus 
есап pie А Pie à la mode Sponge cake 
pue cheese with crackers Baked ham Lamb chop 
mes о нон Chocolate eclair Peanut butter sandwich 
Milkshake Calves liver Cornish game hen 
Моо Shrimp cocktail Crepes suzettes 
Gars edis Ice cream Lemon sherbet 
RUE dod Oysters on half shell Broiled swordfish 
Yon chs Chocolate layer cake Canned pears 
p. р Pureed spinach Roast beef 


Avocado pear 
Tuna fish with noodles 
Homogenized milk 


Soda crackers 
Sauerkraut 
Eggnog 
Sweet roll 


Fresh strawberries 
Hot chocolate 
Pancakes with syrup 


^ 


y 
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alysis based on caloric content, thence the latter analysis was pursued further. 


Table 1 summarizes the Pearson r correlation coefficients for this set of ex- 
standard, we find reasonable and significant correla- 


periments; whatever the 
“fattening” and caloric content, adjudged 


tion between either “richness” or 


300 
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FATTENING 


ESTIMATES 
o 


300 
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o 100 
CALORIES 
FIGURE 1 
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for our stimuli via a standard cookbook (7). Richness and fattening judg- 
ments are highly correlated, 


From the work of Pilgrim and Kamen (5), it is suggested that knowledge 


of food classes may Provide predictive information with regard to food selec- 
tion. Similarly, knowledge of food classes has been found to provide informa- 
tion which considerably improves the predictive value of magnitude functions 


400 


DESSERT 
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relating either "richness" or "fattening" to caloric value. Dairy products, 
meat products, and desserts have been selected as useful categories for this 
particular sample of foods: in Figures 1, 2, and 3 magnitude estimates are 
shown for these three classes of foods, and the homogenized milk standard 
(which showed the highest general correlation with caloric content). Note 
that when stimuli are separated into food classes, high correlations exist for 


FATTENING 


ESTIMATES 


300 400 500 600 700 


CALORIES 


FIGURE 3 
SIMILAR ESTIMATES FOR MEATS 


о 100 200 
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23 : че iu 

individual classes and that these, although all positive, differ signifoantly : 
f y F 

magnitude for the three classes. The most obvious “maverick is the ше 

- Sms ; 

meats; for these data, neither the term “rich” nor “fattening” is a good dis 


i j i an- 
criminator. Analogous results apply to judgments against the other two sta 
dards. 


TABLE 2 
“Power LAW" EXPONENTS 


Homogenized 
Standards Lamb chop Ice cream milk 
Food Types 
Milk Products 
Richness 1.54 1.59 115 
Fattening 145 1.36 1.14 
Meats 
Richness 0+ 15 27 
Fattening 404 .20 21 
Desserts 
Richness 96 .80 92 
Fattening .89 77 95 


Data of magnitude estimatio: 
sensory dimensions, are widely inter 
[note for example, 1, 2]. In the pow 
log-log form, we find the exponents 
dividual food types are highly similar 


n studies in the realm of taste, as well as other 


Preted in terms of a power law form 
*r law context, with the present data in 
indicated in "Table 2, Exponents for in- 


for all three standards, but differ widely 
across food types, Again, there is evidence that caloric content is not a pres 


dictor of "rich" or “fattening,” for the class labelled “meat products.” 
Is there any meaningful rel 


what one calls “rich” and what 
is construed to be “$ ing” 


» as applied to foods, seems 


to lack “meaning,” when applied to certain classes of foods; as applied to 


other classes, it has 4 definitiy 

While it is not clear that 
of foods or “fattening,” 
matched groups, a 


е meaning. 


Subjects base their judgment of the richness 


the two types of judgment, for 
orrelated. Foods that are judged 

attening, and the reverse. The psy- 
chophysical method of magnitude estimation Provides useful indices of mean- 
richness" and “fattening,” as applied to 


a 
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commonly experienced foods, evaluated according to caloric content. Food 


classes (in this case, meats, milk products, desserts) are useful predictors of 
the richness and fattening magnitudes; analysis of magnitude estimations for 
several standards yields food scaling functions which may be described by 
power law relations, but which are highly dependent upon food class. ў 
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THE COLOR-WORD INTERFERENCE TEST: THE EFFECTS 
OF VARIED COLOR-WORD COMBINATIONS UPON 
VERBAL RESPONSE LATENCY* 1 


Department of Psychology, Vassar College 


Joyce L. SICHEL AND KENNETH A. CHANDLER? 


A. THE PROBLEM 


vidual differences, much interest has been shown re- 
attention; that is, Ss performing under 
ting response tendencies are particu- 
have utilized the phenomenon of 


In the study of indi 
cently in tasks requiring differential 
conditions of distraction or when compe 


larly strong. Studies in this general area 
“color-word” interference for such assessment. When Ss are asked to name the 


color in which a color-word is printed, and that color is different from the 


meaning of the color-word itself (e£ the word RED printed in the color 
rable difficulty. Stroop used a serial format of 


green), they experience considera 
known as the Stroop Test (24). Since then, 


such stimuli which has become (1 | 
many investigators have employed this test, and variations of it, to study in- 
tion (11, 25), personality (14, 16), cognitive 


dividual differences in percep 
style (1, 2, 5, 9, 10, 20), development 
pathology (3, 19, 22, 23, 26). Jensen presents 
findings of such investigations (13). 

£ interference used by 
ence between the time taken to name t 
and the time taken to name the color of a similar number of incongruently 
colored color-words. Jensen reports that this may well be the most sensitive 
“interference effect” (13). Several investigators have used 


ch a time difference to reflect the interference phenomenon, 


(6, 17, 18), and drugs and psycho- 
a comprehensive review of the 


most investigators has been the differ- 


The measure 0 
he color of a number of color blots 


measure of the 
variations of su 


А ri i il 26, 1969 
* ived i itorial Office, Provincetown, Massachusetts, on April 26, " 
and a ач 35 New Street, Worcester, Massachusetts. Copyright by 
'The Journal Press. 


1 This study was 


i ted in part by resources from the National Science 
implement Grant GY-1841. Presented in abridged form 


Foundation Instructional Improvement е e 

4 :cal Association, Washington, D.C., April, 1968 (21). 
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but most of these have been no more than algebraic equivalents of this basic 
measure (12, 25). Almost all of this work, however, has been based on the 
implicit assumption that the time difference between naming a color blot and 
a color-word simply sums when numbers of such stimuli are employed. 

Few investigators have reported data concerned directly with the nature of 
the test stimuli themselves. Klein varied the level of meaning of Stroop-tyP* 
stimuli (15) and Jensen suggested that differences in the sequence of response 
(13) may contribute to total variance. Dalrymple and Budayr reported that 
certain serial lists of Stroop stimuli produced greater difficulty than others 
(7). Fraser, using the conventional Stroop format, did not find any differ- 
ences in the interference effect when using selected color-word pair combina- 
tions (8). The present authors view such conflicting findings as reflecting the 
confounding effects inherent in the conventional Stroop design. Thus it would 
appear that insufficient attention has been paid to the fact that the stimuli 
words appear in a context of similar stimuli where they may be perceived in 
groups, though by instructions they are to be named singly. The opinion €X- 
pressed here is that the effects of such word grouping on the color naming Te 
sponse may vary as a function of (a) the particular word combination, (0) 
е color and brightness of the stimuli, (c) the interaction of color and word 
meaning, and (4) variations in perceptual span. 

In order to examine more closely changes in verbal responding that might 
occur as a function of such variables, verbal response latencies were obtained 
to singly presented color and color-word stimuli. It was felt that such а mea- 
sure would be more sensitive and reliable than the usual total-time measure? 
employed by previous investigators and have the added advantage of provi à 
ing information concerning differences in color-naming difficulty for the dis 
crete 15 employed, Single presentations of such stimuli provided the means 
of eliminating context effects present in the conventional Stroop format ап 
of controlling perceptual span. 

The usual Stroop color-word card allows triads and tetrads of similar 
stimuli, as well as many other recurring types of stimulus combinations. 
was felt that this nonsystematic contextual bias confounds the actual inte? 
ference effect. In view of this, an important aspect of the present study was 
the examination of the effects of different types of combinations of adjacent 
color-words. Dalrymple and Budayr (7) felt that differences in color-wo* 

pairs would produce differences in interference effects that would be maxim? 
for that sequence of color-words where the suppression of the word reading | 
response tendency to the first word would increase its unavailability for En 
color-naming response to the second word: for example, the word GREEN» 


t 
#- 
| 
| 


— 
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(printed in the color blue) followed by the word RED, (printed in the color 
green). However, their method did not take into account the various factors 
suggested in the preceding paragraphs, and perhaps this accounts for some 
ambiguity in their findings. The present authors felt that a condition meeting 
their specifications but having the added requirement of suppressing the color- 
naming response one has just made to the first word, RED; (printed in the 
color green) paired with GREEN, (printed in the color red) for example, 
would produce even greater interference effects. Several differing types of 
word-pair combinations were evaluated in this study; the particular word-pair 
combinations used here were selected on the basis of what was felt to be dif- 
fering strengths of word-meaning and color-naming competition. 

Thus the present study utilizes verbal response latencies to singly presented 


(а) color blots, (^) congruently and incongruently colored color-words, (c) 
ns of color blots, and (d) paired combinations of congru- 


ly colored color-words. Examination of such factors was 
ning the optimal conditions for constructing a more 


-word interference test. 


paired combinatio 
ently and incongruent 
directed towards determi 
sensitive and reliable color 
B. APPARATUS AND PROCEDURE 


Ss were twenty-four female undergraduates at Vassar College. All were 
enrolled in the introductory course and served as Ss to satisfy a course re- 
quirement. They were screened to rule out visual deficiencies, language flu- 
ency differences аз à function of marked cultural variance, and medical con- 
ditions felt to be related to the task demand. 

Ss were seated in a moderately lighted room two feet from a Sawyer back- 
reen. Thirty-five mm color slides were presented. automatically 
jector Model 600) ; their appearance activated, by 


means of photo-cell and switching devices, а Standard Electric timer accurate 
to .01 second. Ё controlled the rate of presentation of the slides. 8s were fit- 
ted with a throat microphone in the area of the superior carotid triangle 
which, through а Grason-Stadler Voice Operated relay (model E7300-A), 
provided the means by which Ss verbal response would stop the timer. The 
time between stimulus onset and Ss’ response was taken as the measure of 


verbal response latency- Several pre-experimental trials were employed to fa- 
iniliarize S with the task and to adjust throat microphone and electronic 
arize а $ 
equipment for maximum sensitivity. 
Stimuli used in this experiment con 


following (Cf. Figure 1): (a) Colore 
Subscripts indicate color in which rectangles were 


projection sc’ 
on this screen (Sawyer Pro 


sisted of 48 35-mm color slides of the 


d rectangles: XXX,; ХХХЬ; XXX, 


printed, 3 slides. (2) Con- 
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gruently colored color-words: RED,; BLUE); and GREEN,. Subscripts in- 
dicate color in which word is printed, 3 slides. (c) Incongruently colored 
color-words: RED); GREEN,.,; and BLUE, g. Subscripts indicate color 
in which word is printed, 6 slides. (4) Pairs of colored rectangles: e.g. 
XXX-XXX,; XXXy-XXX,; etc. Subscripts indicate color in which rec- 
tangles are printed, 6 slides. (е) Pairs of congruently and incongruently col- 
ored color-words: 30 slides for the various combinations: e.g. Congruently 
colored pairs: RED,-BLUE,; GREEN,-RED,; etc. and Incongruently col- 


*Subscripts indic 
printed. 


ate color in which stimulus is 


FIGURE 1 


EXAMPLES OF Test STIMULI USED IN Coton NAMING EXPERIMENTS 


огей pairs: RED, GREEN, ; BLUE,-GREEN,,; etc. Subscripts indicate color 


x which word is printed, Examples of test stimuli are presented in Figure 1. 
Figure 2 presents the particular color-word pair combinations selected on the 
b 


asis of expected differences in competing response tendency strengths. 

Ss' task was to name the color of the stimulus presented. Instructions to 8 
were for single stimuli: You are to NAME THE COLOR of what you see 
on each slide as rapidly as you can, Some slides will have words on them, some 
just colored rectangles. IN ALT, CASES you are to respond with the NAME 
OF THE COLOR ; it will be red, blue or green. For paired stimuli: On 
some slides there will be more than one item. 


In these cases you are to name the color of the item on the left first, then 
the one on the right. 
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For the single stimuli experiment a Latin square design was employed so 
that the effects of sequence and order of presentation could be evaluated; an- 
alysis of variance did not reveal significant order or sequence cffects. Thus a 
3 x 3 factorial analysis of variance design was used for the single stimuli 
studies and a 5 X 6 factorial design for the paired stimuli studies. Each 8 was 
tested under all conditions, four measures for each of the forty-eight test 


stimuli. 
LEVELS OF INTERFERENCE 
Minimal Intermediate Maximal 
CONGRUENTLY COLORED |INCONGRUENTLY COLORED INCONGRUENTLY COLORED 
COLOR WORDS COLOR WORDS I COLOR WORDS IT 


RED, BLUE, 
БЕР, GREEN, 
BLUE, RED, 
GREEN, BLUE, 
4iGubscripts 


indicate color in which color-word is printed 


FIGURE 2 
-Worp Pairs VARYING IN ExPECTED LEVELS OF INTERFERENCE 


EXAMPLES OF COLOR 
C. RESULTS 

‘on that verbal response latencies would provide a highly reli- 

aming tasks is supported by the results revealed in 

bilities for the single stimuli was found to be 

+.69; both significant beyond the .01 level 


of confidence. Reliabilities between the various test conditions ranged from 


+.88 to +.96 revealing highly stable individual performance. Retest of 85 
ifferences in verbal response latencies to the 


did not produce any significant d i 
color naming tasks. It was hypothesized that verbal response latencies would 
differ as a function of the variables presented below. 

1. The Effects of Differently Colored Color-Blots 

and Different Color-IF ords Congruently Colored 


The expectati 
able measure for color n 
this study. ‘The odd-even relia 
4-.66 and for the paired stimuli 


That verbal response latencies do differ as a function of color for both 
or-words is seen in Table 1. 


color blots and different congruently colored col 
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It can be seen that verbal response latencies to RED, BLUE, and GREEN 
rectangles are significantly different as are those for the congruently colored 
color-words, RED,, BLUE,, and GREEN,. The hypothesis that such differ- 
ences would occur is thus supported. It is interesting to note that in this com- 
parison the red colored rectangle and the color-word RED printed in red 
both have the shortest response latencies whereas green in both instances has 
the longest latency. This is consistent with the findings of Chandler and Pal- 


TABLE 1 
VERBAL RESPONSE LATENCIES TO COLORED RECTANGLES AND 
f CONGRUENTLY CoLonED CoLor-Worps — 
P Zr Meanb 
Variable? (milliseconds) SE, N 
Colored rectangles XXX, 508 16 24 
XXX, 535 19 24 
XXX, 579 19 24 
Congruently RED, 594 16 24 
colored color-words BLUE, 643 18 24 
GREEN, 698 20 2 


а Subscripts indicate color in which stimulus is printed. 


„5 Differences between means within rectangles and means within color-words are 
Significant at or beyond the .01 level of confidence. 


ubinskas (4) who report that across three differing levels of judged bright- 
ness red colored rectangles were responded to more quickly than green colored 


rectangles; those colored blue appear to be intermediate between red and 
green. 


2. Congruently Colored vs. Incongruently Colored Color-W ords 


It is to be expected that incongruently colored color-words would reflect 
the interference effect through increased verbal latencies when compared to 
congruently colored color-words. That this was the case is shown in Table 2 
and presented graphically in Figure 3. Here we see that in every instance the 
incongruent coloring of a color-word resulted in an increased verbal latency- 
The shortest latencies Presented here occurred with those color-words colored 
red which again reflected the saliency of this particular hue. 


3. Singly Presented Color-Words vs. Color-W ord Pairs 


It was anticipated that congtuently colored and incongruently colored color- 
words would differ in terms of verbal latencies as a function of being PIC 


sented as paits (two color words) compared to single presentation (one color 
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word). The expectation was that the presence of the second color-word in the 
color-word pair, regardless of its color or meaning, would increase competing 
response tendencies with a resultant increase in verbal response latency. The 
latency for singly presented color-words, whether colored congruently or not, 
was found to be 733 milliseconds, while that for the color-word pairs was 941 
milliseconds. The hypothesis that competing response tendencies would be en- 
hanced by the presence of a second color-word and this reflected in an in- 


TABLE 2 
VERBAL RESPONSE LATENCIES FOR CONGRUENTLY AND INCONGRUENTLY 
Согокер COLOR-WORDS^ 


Color-Word¢ Mean (ms)? SExy 
Congruently colored 
RED, 594 16 
BLUE, 643 17 
GREEN, 698 20 
Incongruently colored 
RED, 810 33 
RED, 799 16 
BLUE, 71 23 
BLUE, 813 22 
GREEN, 723 26 
810 25 


GREEN, 


of incongruently colored color-words over congruently 


а Increase in verbal latency 
for RED, 18 percent for BLUE, and 20 percent 


colored color-words was 34 percent 


for GREEN. : 
b A difference of 44 ms is significant at or beyond the .05 level of confidence. 
hich the word is printed. 


© Subscripts indicate the color in w. 


creased verbal response latency is thus supported. The correlation between 
single color-word stimuli verbal response latencies and those obtained with 
paired combinations is -+-98 reflecting the high degree of consistency of indi- 


vidual performance. 


4. Different Types of Incongruently Colored Color-W ord Pairs 

The effects of differing types of color-word pair combinations used in this 
study (see Figure 2) were reflected in differing response latencies. These re- 
sults are presented in Figure 4. The type of color-word pair combination ex- 
pected to pose the most difficulty for Ss, and hence produce the longest verbal 
response latency, did in fact do so. This was the condition, indicated in this 
figure by Incongruently Colored II, of color-word pairs where the reading 


response tendency to the first word was the same as the color-naming response 
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to the second word and where the color-naming response to the first word 
had to be suppressed because it was the word reading response to the second 
word. 


90 
80 
Incongruently 
VERBAL Colored Color 
Words 
RESPONSE 70 
Т = 
LATENCY 
(ms.) 60 Congruently 
Colored Color 
Words 


RED BLUE GREEN 
Color 


Reone Ëi FIGURE 3 
SE LATENCIES FOR INCONGRUENTLY AND CONGRUENTLY CoLoreD CoLoR- WORDS 


D. Discussion 


It is clear, first of all, that verbal response latencies differed as a function 
of differences in color and brightness of the stimuli. The findings reported 
here for these variables are also consistent with those of Chandler and Palu- 
binskas (4). Both colored rectangles and color-words printed in colors Com 
gruent with their meanings showed shorter verbal response latencies when 
printed in red than for blue and green. 

When color-words were presented in colors incongruent with their mean- 
ings, verbal response latencies increased over those for congruently colore 
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color-words, thus demonstrating the basic color-word interference effect. The 
magnitude of this effect, however, was itself modified by the factors of hue 
and brightness and it is clear that this interference effect was not the same 
for all word by color combinations. А significant color by word interaction 


does occur. 


VERBAL 
RESPONSE 900 


LATENCY 
(ms.) 


Congruen 


tly Incongruently Incongruently 
Colored Colored I Colored II 


Word Pair Combinations 


FIGURE 4 
p LATENCIES FOR DIFFERENTLY 


RESPONS! 
VERBAL Pair COMBINATIONS 


COLORED CoLor-WORD 


d brightness, it was also found that the 
ficantly affected the interference phe- 
uced, here by pairing color-words, 
hat obtained for single color- 


actors of hue an 
text signi 
t was introd 
hown over t 


In addition to the f 
extent and type of word con 
nomenon. When word conte* 


an increased response latency was — Б 
words. It is apparent from these findings that the random selection of a spe- 


cific color-word or pair of words could inadvertently operate either to enhance 

or to disrupt the strength and consistency of the interference effect. The con- 

ventional Stroop ‘Test d fact utilize randomly selected incongruently 
B 


oes in 
colored color words masse n as many varied combinations as might 


d together i 


228 JOURNAL OF PSYCHOLOGY 


be expected from the randomization procedure (24). This confounds ы " 
cific relationships between color and word meaning with the effects of p : 
ular word-pair combinations. When the type of word-pair rit be 
brought under more adequate control, the level of interference is enhanc 
and its consistency increased, : | | М 
In order to examine such varying levels of interference, certain types o 
color-word combinations were selected. The work of Dalrymple and dee 
(7), as well as that of Fraser (8), suggested to the present authors that w n 
suppression of a Tesponse tendency is greatest, subsequent interference shou | 
also be correspondingly maximal. Dalrymple and Budayr felt that the — 
Suppression was that of the word reading response tendency to the first wori 
ts unavailability as the color-naming re- 
Sponse to the second word: e.g., GREEN,-RED,; on the other hand, Fraser 
ini suppression was that of the color- 


Correct word reading Tesponse to the second word: eg, КЕР,-СКЕЕМ№. 
While their theoretical Positions seem soun 


consistent, no doubt because they had little 
as expressed here, The present authors felt : 

cations should yield a condition of maximum difficulty in the color-naming 
task and therefore produce the greatest interference, If this were the case, 
then one could prepare a set of color-word stimuli in which maximum inier 
ference would occur and its effects not be confounded by other combinations 


that would tend to weaken it. Examples of such stimuli are presented in 
Figure 2 under the headin 


greater effect upon verb 
pair combination include 
both sorts of Suppressio 
thors to produce the hi 
ing the suppression me. 
the word-pair combina 


d, their results were not altogether 
control of contextual confounding 
that a combination of both specifi- 


ntioned by Dalrymple and Budayr. It was found that 


tions suggested by Dalrymple and Budayr did produce 
the second greatest color-naming difficulty. 


In considera 
bal response la 
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unit by itself with sufficient spacing between it and the next pair as the eye 
travels vertically down the stimulus plate. In this way confounding context 
effects are controlled, perceptual span held constant, and only those word-pair 
combinations are used that yield maximal interference. Five plates, each of 20 


PLATE III PLATE IV PLATE V 
BLUE,- BLUE -GREE] - 

д Н BLUE, RED, 
BLUE, P un, GREEN, -RED, BLUE-GREEN, 


PLATE I PLATE II 
BLUE.-GREEN, || GREEN,-BLUE, 


GREEN, -RED g BLUE,-RED, 
GREEN .-RED, GREEN,-BLUE, | | BLUE, * RED, REDy-BLUE, GREEN, -BLUE, 
RED, -GREEN - GREEN -BL GREEN, -BL a 
aca tms | mer || meum | eiu 
b Eb ас. г png 

CREEN -RED g GREEN,-BLUE, | | BLUE,-GREEN, || RED)-BLUB, GREEN, -REDg 
BLUE,-REDy BLUE-GREEN, | | BLUB,-RED, BLUE, -RED, BLUE, -RED, 
GRREN,-BLUB, || BLUE,-GREENp|| REDp-ORESNr GREEN,-BLUE, || BLUE,-GREEN, 
GRERN.-BLUE, || GREENS-REDg GREEN,-REDg BLUE, -REDy GREEN, -BLUE, 
RED -BLUE p BLUBg-GREEN | | GREEN -REDg САЕЕМЬ -ВШЈЕр || GREEN,-RED, 
REDy - GREENz GREEN,-RED, || BLUE,-RED, BLUB,-GREENp || GREENL-BLUE, 
GRREN,-BLUE, || BLUE RED} RED), -BLUE, RED,-GREEN,. BLUE-GREEN, 
BLUEg-GREEN, || REDy-BLUE, RED -GREEN, GREEN, -RED g RED, -GREEN,, 
BLUE, -RED, REDy -BLUB, BLUE, - GREEN, BLUE-GREEN), GREEN, -BLUE g 
REDy- BLUE, BLUEg-GREEN, | | GREEN,~BLUB, BLUE, -REDy RED, -BLUE,. 
BLUE-GREEN} RED -GREEN p GREEN, -BLUE g GREEN „-REDg BLUE, -REDy 
BLUE-GREEN} GREBNp-REDg GREEN, -RED g GREEN -BLUE g GREEN -RED g 

GREBN,-RED, || REDp-BLUE, GREEN, -REDg RED, -GREEN, 

GREENp-BLUEg | | CREBNp-BLUBg BLUE -RED, BLUE,-REDy 

RED -GREEN- BLUEQ-GREEN || ВЬШЕр-бИЕЕНЬ REDp-BLUBy 

*Subscripts indicate color in which color-word is printed. 
FIGURE PAIR COMBINATIONS 


SURE око 


LORED C! 
Hs UM INTERFERENCE EFFECTS 


INCONGRUENTL 
MAXIM 


WuicH PRODUCE 
revity of form, while also offering the means 


color-word pairs, provide for b 
for bject reliability. . 
awe eri s combined to yield a 


In its present use, total time for reading each plate i n 
total interference card time score which may be compared jos simi Е T 
time score for an identically arranged set of plates of s d n 
this way, the present authors have found that attention to the majo es 
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ing i iti color- 
has been effected, resulting in a more sensitive and stable measure of 
word interference. 


E. Summary AND CONCLUSIONS 


Single and paired Stroop-type color-word stimuli were presented in eel 
sion to Ss by means of color slides. Verbal reaction times for color naming є 
these stimuli were measured. The factors of hue, brightness, extent of Won 
context, and type of word-pair combination were studied with respect to Ber 
contribution to the basic color-word interference effect. It was found that e 
lected types of color-word pairs increased the magnitude and stability of the 
interference effect. A more refined and internally consistent color-word nam- 
ing test utilizing such pairs in a printed format is presented. 
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THE SELF-CONCEPT OF CREATIVE ADOLESCENTS* 1 


Department of Psychology, Fordham University 


CHARLES E. SCHAEFER 


A. INTRODUCTION 


In recent years the Gough Adjective Check List (ACL) has been employed 
With increasing frequency in psychological research. The main value of this 
Instrument for personality assessment is that it offers a unique picture of a 
Person’s self-concept by presenting an extensive checklist of adjectives com- 
monly used in everyday life: for example, spunky, good-looking, smug. The 
Motivational significance of the self-concept in determining a person's behav- 


lor has been pointed out by Rank (16). 
ailable scales for the ACL seem particularly rel- 


Many of the currently av 
evant for research on the creative person. Indeed, studies from the Institute 
of Personality "Assessment and Research (12) have indicated that creative 
adults tend to score high on such ACL scales as Autonomy and Aggression, 
les as Abasement, Affiliation, 


P a scoring comparatively low on such sca 
Deference. In terms of individual adjectives, MacKinnon (11) reports that 
e architects checked the following 


80 Percent or more of a group of creativ 

adjectives as self-descriptive: imaginative (98 percent) ; active (92 percent) ; 
honest, idealistic, and inventive (90 percent). MacKinnon and his associates 
further report that staff observations verified the accuracy of the ACL self- 


descriptions, 
The purpose of the present study was to extend creativity research with 
h school students. Several 


the ACL to a younger population: namely, hig! students. gev! 
authors (20, 21, 22) have pointed to the scarcity of creativity investigations 
D a i: 


With this age group, Yet, 188 Hammer observed, "Clinical experience suggests 
а 5 › a 5 

that Psychological techniques administered to adolescent subjects are fre- 

Wently more revealing of the underlying traits and feelings than are tests 


admin}. 
ministered to adults” (8, Р. 327)- 
жаш, 
i 15, 1969, 
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y The Jo 
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В. Метнор 
1. Subjects 
The subjects were 800 male and female students from 10 high schools in 
the New York metropolitan area.? The schools were chosen because they of- 
fer courses or programs providing opportunities for creative achievement, 
such as creative writing or science projects. "The subjects included 434 Ser 
niors, 272 juniors, and 94 sophomores. The group as a whole was superior 
in terms of academic achievement and intelligence test scores. 


2. Creativity Criteria 

The subjects were assigned to eight criterion groups of 100 students each, 
according to the following threefold system of classification: achievement 
(creative or control), field of achievement, and sex. For the boys, the achieve- 
ment fields were broadly classified as either art-writing or scientific. The art 
writing field for the boys included all forms of visual arts and all types of 
literary expression, while the scientific field encompassed the natural sciences 
and mathematics. Since few girls were found who produced a scientific work; 
the girls’ specialty fields were classified separately as either art or writing: 
The eight criterion groups were designated: Creative-Art-Writing-Boys 
(CrAW-B), Control-Art-W riting-Boys (CoAW-B), Creative-Scientific~ 
Boys (Cr8-B), Control-Scientific-Boys (CoS-B), Creative-Art-Girls (СгА- 
G), Control-Art-Girls (CoA-G), Creative-Writing-Girls (CrW-G), Сот" 
trol-Writing-Girls (CoW-G). 

The criterion employed for assessing creativity was a combination of teach- 
ers evaluations and creativity test scores. Accordingly, the creative Ss were 
nominated by their teachers as having produced one or more creative prod- 
ucts, such as drawings, poems, stories, science projects. "They also scored pen 


. ^ e 
a cutoff point on Guilford's Alternate Uses and Consequences tests. Th 
control Ss were nominated by the s o concrete 


À hon ame teachers as having shown n 
evidence of creativity, 


“ford 
н i and they scored below a cutoff point on the Guilfor? 
screening tests. Within each of the four specialty fields, the creative and co" 


trol $$ were matched in school attended, class in which enrolled, grade level 
and grade-point average, 


3. Procedure 
The ACL was administered as part of a larger test battery during # st 
hour testing session held outside of school hours, The Ss were paid for P^" 
2 The participating high schools w 


ville, Erasmus Hall, Forest Hills, 
Stuyvesant. 


" x 
ere Abraham Lincoln, Art and Design, Bug 


Jamaica, Midwood, Music and Art, Regis 
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ticipation in this study. Identification numbers were employed during the 
testing session to provide anonymity, and the students were assured of the 
confidentiality of their responses. The ACL was scored for each of the 24 


currently available experimental scales. 


TABLE 1 
EVIATIONS OF Boys’ CRITERION Groups oN ADJECTIVE CHECK LisT 


MEANS AND STANDARD D 


Criterion groups 


. Art-Writing Scientific 
Adjective Check Creative Control Creative Control 
List scales Mean Mean SD Mean SD Mean SD 


Creative groups hypothesized higher 


No. Unfavorable 
j 54.27 11.33 55.24 11.58 53.04 10.91 


Adj. $548 9.57 
Self-confidence 49.45 10.00 47.24 9.88 $141 9.17 46.30 9.92 
Lability 57.51 8.96 $0.48 10.00 55.57 12.10 50.73 841 
Dominance 50,37 1142 49.14 10.53 52.23 10.25 49.22 10.14 
Exhibition $5.64 9.63 51,29 1142 $5.39 10.24 50.66 11.07 
Ашопоту 3680 947 4932 990 $498 971 4916 8.60 
Aggression 56.29 9.55 5234 10.14 5708 1115 $247 9.64 
Change $5.59 10.21 47.74 10.06 51.19 10.38 48.31 9.67 
Control groups hypothesized higher 

т 4678. #77 4732 8.84 


Defensi 3 8.92 47.37 
CORR he 9.82 45.72 10.17 4340 11.09 4612 9.99 
4942 1177 4915 9.67 


Self-control 40.62 $16 977 
5 1.21 49. b H 
at т 42.56 11.67 47.27 10.05 


пе c 
urtur. 45.13 10.69 47.78 10.0 

Suo 445 1000 soos 1077 О 5110 10.02 
Deference 42:64 1027 4896 11.09 4255 1019 47.93 9.58 


No specific hypotheses 5j 172208556 


Number checked 48.24 8.09 4712 8.98 48.65 


44.52 10.65 44.39 9.97 


No. Favorab| 

Adj. d 4190 932 4505 935 
Personal 

л aas эм 08 9% 4313 10.8 4526 911 
Achievement 4929 11.33 4937 9.69 52.71 us 50.69 945 
Intraception 49.66 1032 4879 10.65 pon E 48.10 p 
Endurance 4652 1092 5071 940 . 308 rl а з 
шне ы on ue — 1038 4599 1025 4559 927 
Affiliation 4 эң 4699 M А ы 
Abasement 4829 1046 50.62 oat. 4146 9,7 SUSON 
ounseli 

Readinéts pas mozi | 1208782536 4909 989 51.66 10.83 


C. RESULTS 
ndard deviations of the criterion gro 
. On the basis of a review 


tive students would score si 
s: Number of unfavorabl 


ups on the ACL 
of the literature, 
ignificantly higher 
e adjectives, 


The means and sta 
are presented in Tables 1 and 2 
it was hypothesized that the crea! 
than the controls on the following scale 
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Self-confidence, Lability, Dominance, Exhibition, Autonomy, уке, ie 
Change. It was further hypothesized that the control $$ would score ү. 
on the scales of Defensiveness, Self-control, Order, Nurturance, Succo p 
and Deference. The ACL group comparisons found in Table 3 indicate tha 


TABLE 2 - List 
{ ADJECTIVE CHECK 
MEANS AND STANDARD DEVIATIONS OF GIRLS’ CRITERION GROUPS ON ADJECTIVE Cu 


Criterion groups 


Art Writing А 
Adjective Check Creative Control Creative wem ED 
List scales Mean SD Mean SD Mean SD 


Creative groups hypothesized higher 
No. Unfavorable 


0.48 
Adj. 5768 13.04 S471 1190 — 5675 1504 — 5192 100 
Self-confidence 48.35 $91 46.58 9.63 50.73 10.11 48.74 10.06 
Lability 54.03 10.07 51.29 9,25 5546 10.71 48.03 9.79 
Dominance 45.88 10.20 45.27 10.85 47.25 10.59 0.30 9.84 
Exhibition 5247 10.99 50.99 10.15 5441 10.62 Pond 8.64 
Autonomy 54.16 10.50 52.62 10.53 5710 10.74 T 9.52 
Aggression 55.58 10.59 53.94 10.80 57.80 10.25 PM 10.24 
Change 5418 10.13 5121 10.05 58.05 11.04 50. 
Control groups hypothesized higher 9.75 
Defensiveness 40.70 1211 43.30 1144 3948 1245 an 945 
Self-control 43.61 10.31 45.73 9.70 42.02 11.21 de 9.25 
Order 44.16 11.04 45.76 8.54 42.200 11.60 VS 10.27 
Nurturance 42.72 12.60 446% 11.77 42.8 12.52 505 8.82 
Succorance 3346 10.54 — 5579 1035 5443 1113 5320 Өл 
Deference 44.24 1042 46.33 9.87 41.37 10.37 48. 
No specific hypotheses 4 9.08 
Number checked 5548 10.50 50.96 9.65 56.50 12.27 50.5 
No. Favorable 87 10.24 
j. 4717 11.20 47.07 1211 46.39 11.26 43. 
Personal 26 970 
Adjustment 4238 10.5 43.92 1136 4182 1026 17 
Achievement 45.77 9.53 45.35 9.92 47.23 10.25 
Intraception 4647 9.38 47.26 10.95 47.72 840 
Endurance 44.51 10.64 45.96 8.78 43.53 11.35 
Heterosexuality 49.03 11.89 48.28 10.99 47.80 11.61 
Affiliation 40.79 11.70 41.69 11.14 40.88 12.24 
Abasement 51.39 10.93 52.56 11.01 50.51 10.20 
Counseling 
Readiness 55.32 10.96 5431 1119 57.98 10.10 


the mean differences on the above 14 scales were almost invariably in the is 
pothesized direction. "The direction of mean differences for the ten remainin 
ACL scales was not consistent. -— 
In view of the observation by Gough and Heilbrun that “in many 5' o 
tions the greatest value of the ACL may accrue from noting, perc Ў 
analyzing those specific words which an individual or group of indivi 
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э (7, p. +), an item analysis was conducted to 


А has checked as self-descriptive’ 
iated the creative and control 


identify the individual adjectives that different 


Ss in each specialty field. Following the item analysis procedure outlined in 
p Schaefer and Anastasi (19), a compound probability value was determined 
| TABLE 3 

Normat Curve Ratios* oF MEAN CRITERION GROUP DIFFERENCES 
ох ADJECTIVE CHECK List 
Criterion groups” 

CrAW-B CrS-B CrA-G CrW-G 

qs. vs. vs. vs. 
Scales CoAW-B CoS-B CoA-G CoW-G 

Creative groups hypothesized higher 
No. Unf. bl 

Adj. cna 82 1.38 1.68* 2.89** 

m Self-confidence 1.57 3.78*** 1.35 1.98* 
Lability 5.24*** 3.29*** 2.00* 4.57888 
Dominance 79 2.09* Al E 
Exhibition 2.92** age" 99 2.84** 
Autonomy 5.09*** 448*** 1.03 5.80*** 
Aggression 2,84% 3.14*** 1.08 5.8g*** 
Change 5495** 2.03* 2.08* 5,42% 

Control groups hypothesized higher* 

Defensiveness —2.26* E —1.56 =e 
Self-control —3.62*** —1.82* —1.50 —5.28 
Order —3.26**° ET. —1.15 —4.99*** 
Nurturance —1.80* 3.06** = E 
—1.4 — 94 — . 
Ва: Se 3.84*** —145 —s3iee 
No specific hypotheses 
Number checked .92 1.13 т” 3.9 wee 
e | men ur" 09 06 —2.23* 
Personal aj Lo ggees 
Adjustment —245* = К, ме: 

f Achievement — .05 К —1.05 —443*** 
Endurance gres T ao = 6s L188 
Intraception .59 120 cus BG —2.62** 
Affiliation —1.33 176 46 —133 
Heterosexuality — .88 = `$0* — 75 —135 
Abasement —1.65 — 

ounseling 64 3.29** 
Readiness — .98 175 
for control group. 
© E -Art- Writing-Boys, 


? Negative si indicate higher mean scor! то 
MET no o d а ES 
CrS-B = Creative-Scientife T i Are Girl CrW-G = Creative-Writing-Girls, 
CoW-G — Control-Writing-Girls. — 4 
* One-tailed tests of significance used. 
*«.0. 
at b S01. 
же р <.001. 
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for each of the 300 adjectives of the ACL. Those adjectives which signifi- 
cantly differentiated the creative and control groups at the compound prob- 
ability level of .10 or better in one or more of the four specialty fields were 
identified. 

The results of this item analysis can be summarized as follows. The cre- 
ative adolescents, significantly more often than the controls, viewed themselves 
as creative (imaginative, artistic, ingenious, original), independent (uncon- 
ventional, individualistic), uninhibited (spontaneous, impulsive), iconoclastic 
(assertive, outspoken, rebellious), complicated (complex, reflective, cynical, 
idealistic), and asocial (aloof). In striking contrast, the adjectives differen- 
tiating the control students present a remarkably consistent picture of passive 


conformity: dependable, cooperative, contented, conventional, quiet, and sl- 
lent. 


D. Discussion 


The results of the present study substantiate the impressive record of the 
ACL in creativity research. The high level of validity found with a number 
of ACL scales in previous investigations of creative adults (4, 13) was con- 
firmed in this study of creative adolescents. The ability of a number of ACL 
scales to identify creative adolescents of both sexes and across diverse fields 
of achievement indicates that the scales have considerable validity generaliza- 
tion. The results support the observation of Cashdan and Welsh (3) that 
creative adolescents are very similar in personality despite sex and specialty 
differences, "This similarity underscores the feasibility of Golann's (6) sug- 


gestion that personality correlates be used as criterion measures in the studY 
of creativity. 


1. The Self-Image of Creative Adolescents 


Since recent factor analyses of the ACL scales (9, 15, 18) have pointed 
to a substantial degree of overlap in the 24 scales, the results of the present 
study can be described in terms of general factors or themes. The ACL scales 
found to characterize the creative Ss in this study suggest the following main 
themes: (a) complexity and reconciliation of opposites; (5) impulsivity a? 
craving for novelty; and (c) autonomy and self-assertion. 


a. Complexity and reconciliation of opposites. The openness of the creative 
adolescent to many sides of his personality 


traits, is apparent in the finding that the cre 
scales of Number of Adjectives Checked 


irable 
, including socially undesi? 
ative Ss scored high on the 


d- 
ae : and Number of Unfavorable AC 
jectives, while they scored low on Defensiveness and Personal Adjustmen™ 
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e en with this theme are a number of the individual adjectives found 
to be distinctive of the creative $$: namely, complex, assertive, aloof, cynical, 


idealistic, impulsive, reflective, and rebellious. These adjectives suggest that 


th iv А А iP Y 
е creative person not only recognizes opposing forces in his nature, such as 


cynical vs. idealistic, impulsive vs. reflective, and aloof vs. assertive, but is 
able to reconcile and integrate them into his personality. To do this the cre- 
ative $$ must possess exceptionally strong ероз. In this connection, Barron 
(2, p. 159) has described the creative person as “both more primitive and 
more cultured, more destructive and more constructive, crazier and saner, 
than the average person." The control Ss, on the other hand, were a picture 
of calm contentment, as reflected in the adjectives that characterized them, 
contented, and quiet. 

djectives fo 
uestion as to Wl 


such as cooperative, 


The number of unfavorable a und to differentiate the creative 


and the control groups raises the q hether the creative Ss were 
simply more open and honest in their responses: that is, more open to self- 


disclosure behavior as described by Jourard (10), or whether the adjectives 
tween the Ss in the two groups. The 


reflected real personality differences be 
discrepancy between the calm, quiet, and contented self-portrait of the control 
Ss and the typical depiction of the adolescent years as stormy and stressful 
would support the former view. On the other hand, MacKinnon’s (11) find- 
ing that staff observations tend to verify ACL self-descriptions would support 
the latter theory. Further research is needed to resolve this question. 
b. Impulsivity and craving for novelty. Closely related to the complex, 
Ss is the second general theme: namely, 


conflictful self-concept of the creative 
openness to impulse expression and desire for novel experiences. This theme 


is reflected in the high scores of the creative students on the scales of Labil- 
ity, Exhibition, and Change; and their low scores on Endurance, Order, and 
Self-control. The following individual adjectives found to be distinctive of 
the creative Ss are also indicative of this theme: impulsive, imaginative, spon- 
taneous, artistic, ingenious, and original. In brief, the creative Ss see them- 
selves as continually striving for novelty in their environment, both by im- 


Pulse expression and more deliberate creative works. Rees and Goldman (17) 
apt to occur in college students who are 


also found that creativity is more 

impulsive and characterized by a lack of deliberation, self-control, and re- 

straint, They speculate that the impulsive, acting out, histrionic behavior of 

the creative person is indicative of а basic hysterical character structure. А 
eative achieve- 


motivational relationship between ор 
ment has been suggested by Maddi (14). _ 
c. Autonomy and Self-A. The third 


enness to novelty and cr 


general component of the 


ssertion. 
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self-concept of the creative adolescents is their independence of thought and 
action, coupled with a leadership role in social situations. Thus, the creative 
Ss scored high on the ACL scales of Autonomy, Aggression, Dominance, and 
Self-confidence, while they scored relatively low on Abasement, Affiliation, 
Deference, and Nurturance. The results of the item analysis showed the cre- 
ative Ss to be distinguishable by their self-image as aloof, individualistic, and 
unconventional. The creative adolescent, then, prefers to work by himself, 
although he tends to be poised, influential, and controlling in social situations. 

In view of the above, it is not surprising that an analysis of the Ss’ bio- 
graphical data revealed that more creative than control Ss were elected to 
the highest position of leadership in high school extracurricular activities (1, 
19). Similarly, the results of a factor analytic study by Furcon, Bachr, and 
Zolik (5) disclosed that creative persons tend to relate to their social milieu 
by dominating and communicating with groups, yet otherwise remaining so- 
cially aloof and demonstrating independence of thought and action. It has 
been often found that this independence is carried over into the work envi- 
ronment by a preference for highly unstructured work situations which foster 
autonomy, freedom, and an idiosyncratic approach to work activity (5, 12). 


2. Sex and Specialty Differences 


Although the four creative groups in this study tended to show similar pat- 
terns of scores on the ACL scales, there were obvious sex and specialty differ- 
ences in scale scores between the four groups. The most conspicuous variation 
was that relatively few significant differences between creative and control 
group means were found in the girls’ art field. This finding is undoubtedly 
related to the fact that the majority of creative and control girls in this field 
attended high schools that specialized in art. Thus, the girls were already a 
highly select group in regard to creativity so that the creative-control distinc- 
tion in this field was rather tenuous. 

An obvious sex difference in ACL scores was that the creative girls (as 
compared with their controls) tended to check more adjectives as self-de- 
scriptive, particularly unfavorable ones, than the creative boys (as compared 
with their controls). This difference was particularly noticeable for the girls 
in the writing field. Perhaps the simplest explanation for this finding is that 
girls tend to be more verbal than boys and one would expect girls with writ- 
ing talent to be especially verbal. Also relevant is "Trezise's (23) finding that 
creative girls tend to be more introspective and interested in self-understand- 
ing than creative boys. 
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E. SUMMARY 

The Adjective Check List was found to be a highly valid instrument in 
identifying creative high school boys and girls in different specialty fields. As 
contrasted with matched control groups, creative adolescents were found to 
have similar self-images which were described in terms of three themes: com- 
plexity and. reconciliation of opposites, impulsivity and craving for novelty 
ssertion. It was concluded that a person's тын 
in motivating him towards creative achievement. 


and autonomy and self- 
plays a significant role 
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DEVELOPMENTAL STUDY OF SOCIAL ROLE 
PERCEPTION AMONG HUTTERITE — 
ADOLESCENTS* ! 


Department of Psychology, University of Manitoba 


SHIRIN SCHLUDERMANN AND EDUARD SCHLUDERMANN 


A. INTRODUCTION 


In a previous study of the social role perception among Hutterite children, 
Schludermann and Schludermann (2) suggested, on the basis of results ob- 
tained, that as Hutterite children grow older, they may be able to make the 
kind of differentiations in their perceptions that researchers using the seman- 
tic differential are accustomed to obtaining. The present study on Hutterite 
adolescents was done with a view to testing the initial assumption that there 
аге a few basic dimensions underlying Hutterite adolescents’ perception of 
social roles in their communal society. The second part of the assumption was 
that the basic dimensions would emerge as clearly differentiated dimensions 


which would indicate a certain degree of change in the developmental differ- 
t is generally assumed that an older person is more 
eptual fields than a younger 
(1)] have demonstrated the 


entiation in cognition. 1 
capable of making finer differentiation in perc 


person. Werner (3) and Piaget [see Flavell 


validity of such a developmental change in cognitive differentiation. 


B. METHOD 
1. Subjects 


One hundred and twenty-eight adolescents were selected from the public 
schools in 20 Hutterite colonies of Manitoba. The subjects were 14 to 16 
years old and consisted of only those adolescents who were average or above- 
average in their abilities. The selection was made on the basis of their school 
achievement records and teachers’ evaluation. Extremely dull and outstanding 
he selection so that the effects of extreme differ- 


ences in intelligence could be somewhat controlled. The Hutterite colonies 


i i itori Provi j ssachusetts, on May 16, 1969, 
* Received in the Editorial Office, Provincetown, машын АА Сару by 


and published immediately at 35 New Street, Worcester, 
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are located in rural areas and form part of the socioeconomic homogeneous 


Hutterite communal society. 


2. The Instrument, Concepts, and Scales 


The description of the semantic differential scales, concepts, and their 
method of selection were kept the same as in the previously reported study 
(2). Therefore, they are not repeated here. The investigative procedure was 
also the same as in the previous study. 


C. RrsurLTS 
The sample was subdivided into two groups of girls (N = 76) and boys 
(N = 52). Correlation matrices of 15 X 15 scales for the two subgroups 
were calculated separately and factor analyzed by the principal axes method. 
The principal axes matrices obtained for the two groups are similar to 
those obtained for Hutterite children. Table 1 shows that only the first fac- 


TABLE 1 
PROPORTION OF VARIANCE ACCOUNTED FOR BEFORE ROTATION 


Cumulative 


percent 

Sex Ages F, F, Fy F, variance 
Girls 14-16 yrs. 30.71 Е 11.96 7.09 637. 56.13 
Boys 14-16 yrs. 32.25 13.69 7.12 6.48 59.54 


tor in both groups accounts for one-third of the total variance. The second 
factor accounts for 12 to 13 percent and the third factor accounts for seven 
percent variance. "Thus the first three factors account for approximately only 
50 percent of the total variance. It is clear that with the exception of the 
first factor, the rest of the factors are specific factor variances. The first fac- 
tor, which qualifies as a common factor variance, appears to be an undiffer- 
entiated global factor. "Twelve of the 15 variables have loadings higher than 
2.40 on the first factor of the girls’ matrix, and ten of the 15 variables have 
similar loadings on the first factor of the boys’ matrix. Thus the dominant 
dimension appears to be essentially “good-bad,” and any person who is Inter- 
esting, Brave, Busy, Bright, Useful, Pleasant, etc. is a “good” person. The 
second factor consists of four variables: important, friendly, clean, and active; 
whereas the third factor has a high loading only on one variable in both 
groups. The reader will be impressed with the similarity of factor matrices 
obtained across age groups of Hutterite children eight years to 13 years (in 


previous studies), and adolescents 14 to 16 years in the present study. 


е 
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D. Discussion 
]. Factor Structure 


It was suggested previously (2) that such low common factor variances 
may be due to several reasons. One is that the subjects did not understand 
the instructions clearly; and a second is that the subjects had only a vague 
understanding of the descriptive terms. These two sources of error may be 
expected to be less influential in the case of the adolescents than in the case 
The task itself was kept simple enough for a child to follow, 
and the descriptive adjectives were carefully selected so that a child as young 
as eight years old would be able to understand the meaning of the words. 
Therefore, it may be assumed that these two sources of error which might 
ariances in the case of the children, would 
not necessarily have had such drastic effects in the case of the adolescents. A 


third source of error variance was suggested, and that is the peculiarities of 
Hutterite children and adolescents grow up. 
ised and limited 


fore quite likely 


of the children. 


have reduced the common factor v 


the social environment in which 
Hutterite adolescents! social interactions are strictly superv 
mbers of a few other colonies. It is there 
asked to describe certain social roles in 
in terms of the two or three other 
y further tend to think of these 
han in terms of 


to visits with me 
that the. Hutterite adolescents, when 
their colonies, may tend to think more 
farm-foremen or pigmen they know. They ma 


individuals in terms of their personal characteristics rather t 
and role. It is very likely, therefore, that the peculiarities of 
to exert a very strong influence on the direction and 
adolescents will be able to direct their thinking 


characteristics of their envi- 


social function 
social environment tend 


degree to which Hutterite 
and perception of certain relevant or irrelevant 


ronment. 
2. Social Perception 


In terms of the number of соп actors that may be used as indices of 
cognitive differentiation (i.¢., the greater the number of common factors, the 
greater the cognitive differentiation), Hutterite adolescents fail to show any 
clear differentiation of social roles. The Hutterite adolescents’ undifferenti- 
ated social perception is not much different from that of the Hutterite chil- 
dren. It may be asked as to why the Hutterite adolescents’ cognitive growth 


fails to show the expected concomitant changes towards cognitive differentia- 
ber that Hutterite children and 


tion. Here again, it is important to remem 
adolescents have to live in a society which has deliberately abolished all signs 


of status symbol and which offers very little positive reinforcement for mak- 
1 > H e + . B BJ 7, " 
ing distinctions between social roles, such as superior-inferior, etc. Very often, 


amon Ё 


246 JOURNAL OF PSYCHOLOGY 


there is considerable negative reinforcement among the Hutterites against at- 
tempts to make such distinctions outwardly, and to publicize openly through 
dress, residence, and mode of behavior, one's superior status. It appears then 
that in Hutterite society failure to obtain developmental differentiation in the 
social role perception of Hutterite adolescents is very likely due to the pow- 
erful social influences that shape the growing personality of the young ones. 
If this assumption is true, then one may expect to find similarly undifferenti- 
ated perception of social roles even among the Hutterite adults. 


E. SUMMARY 


One hundred and twenty-eight Hutterite adolescents, 14 to 16 years of 
age, were selected from 20 colonies in Manitoba. The adolescents described 
the same 16 social roles on 15 semantic scales as the Hutterite children did. 
The two correlation matrices of 15 X 15 for two sex groups were factor an- 
alyzed separately. There was only one common factor which accounted for 
one-third of the total variance. The principal factor matrix of the adolescents 
was almost as undifferentiated as that of the children. "The assumption of a 
few basic dimensions underlying adolescents’ perception, as well as the as- 
sumption of increased cognitive differentiation in social role perception of 
adolescents, were not supported. 
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RAVEN'S PROGRESSIVE MATRICES: MORE 
ON CONSTRUCT VALIDITY* 


East Orange Veterans Administration Hospital and Rutgers—The State University 


Henry R. Burke AND WILLIAM С. BINGHAM 


А. INTRODUCTION 


A previous study (2) cited evidence that Raven’s Progressive Matrices had 
validity as a measure of general intellectual functioning. It loaded signifi- 
igence in an analysis of the intercorrelations 
Cohen’s factors (3, +), on the 
General Classification 
(N = 39) interpreta- 


cantly on a factor of general intell 
of scores on the Progressive Matrices, on 
WAIS (7), and on the separate parts of the Army 
Test (1). Because of the small sample size in that study 
tions had to be highly tentative. 


B. PROCEDURES 


For the present study, the senior author himself accumulated 52 additional 


protocols, using the same tests. The Progressive Matrices was administered 
without a time limit. The other tests were administered in standard fashion. 
The WAIS was scored also for Cohen’s factors (Verbal Comprehension, 
Memory, and Perceptual Organization). The AGCT was scored separately 
for Vocabulary, Arithmetic, and Block Counting. M 

All subjects were male patients at the East Orange Veterans Administration 
Hospital who were referred to the Psychology Service for vocational coun- 
seling. Ages ranged from 19 to 59 with a mean of 35.1 and 8D 8.7. The 91 
sets of test scores were intercorrelated and subjected to a principal components 
n electronic data processing equipment at the 


analysis and a varimax rotation 0 à Suis 
ation Services, Rutgers University. 


Center for Computer and Inform 
C. FixpiNGS AND Discussion 
1. Reliability of the Progressive Matrices 


g in this study was the correlation be- 
Matrices: r = .92. 
e reliability of 


Perhaps the most surprising findin 
tween the 91 pairs of odd-even scores on the Progressive № 


Application of the Spearman-Brown formula for predicting th 
a test of double length (5, p. 194) yielded a reliability coefficient of 96. 


ssachusetts, on May 20, 1969, 
Мазза Massachusetts. Copyright 


* Received in the Editorial Office, Provincetown, 
and published immediately at 35 New Street, Worcester, 


by The Journal Press. 
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TABLE 1 
INTERCORRELATIONS OF T'EST SCORE VARIABLES 
(N = 91) 
Variables 1 2 3 4 5 
1. WAIS Verbal* 
2. WAIS Perf.* 70 
3. WAIS Total* 95 90 
4. AGCT Total** 80 71 82 
5. Raven PM** 65 76 75 67 
M 60.9 45.3 106.1 104.0 
SD 14.2 10.4 22.8 19.6 11.5 


Note: Decimals omitted. 
* Scaled scores. 
** Raw scores. 


2. Concurrent Validity 


In this study, the Progressive Matrices, as an apparently nonverbal and 
untimed measure of "intellectual capacity" (6, p. 1), correlated slightly better 
(Table 1) with the WAIS Performance (7 —.76) than did either the 
WAIS Verbal (r = .70) or ће AGCT Total (r = .71). The Raven PM 
did not correlate quite as well with the WAIS Total (r = .75) as did the 
AGCT Total (r = .82). This fact seems to be attributable to the highly 
verbal character of both the WAIS and the AGCT: WAIS-Verbal and 
AGCT Total, r = .80. As further evidence, Table 2 gives a paired compar- 
ison of the correlations of the Raven PM and AGCT scores with the WAIS 
subtest scores. However, the Raven PM correlated about as well with the 
WAIS Verbal (r = .65) and the WAIS Performance (r = .76) as these 
two major parts of the WAIS did with each other (r = .70), as seen 1n 
Table 1. 


TABLE 2 
INTERCORRELATIONS OF WAIS SUBTEST SCALED SCORES WITH 
Raven PM anp АССТ Тотал, Scores 


(N = 91) : 
WAIS subtests Raven PM AGCT 
Information 56 69 
Comprehension 50 61 
Arithmetic 43 61 
Similarities 59 69 
Digit Span 47 54 
Vocabulary 55 70 
Digit Symbol 51 52 
Picture Completion "m 59 
Block Design 60 E 
Picture Arrangement 50 15 
Object Assembly 62 E | 


Note: Decimals omitted. 
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TABLE 3 
INTERCORRELATIONS OF TEST SCORE VARIABLES AND AGE 
(N = 91) 
Variables 1 2 3 + 5 6 7 8 
1. Age 
2. Cohen V 03 
3. Cohen M —0+ 63 
+. Cohen PO —11 51 52 
5. AGCT V 06 77 60 56 
6. AGCT N —14 72 69 56 81 
7. AGCT BL —18 +5 +5 59 58 68 
8. Raven PM —32 59 49 61 60 66 55 
M 35.1 10.3 9.8 9.8 26.7 28.2 25.5 41.3 
SD 8.7 2.8 2.7 2.7 9.0 9.1 3.5 11.5 


Note: Decimals omitted. 

The intercorrelations in this study (N = 91) of the Raven PM with the 
WAIS and AGCT, and of the WAIS with the AGCT, are somewhat less 
remarkable than the same correlations (N = 39) in the previous study (2) : 
Raven PM and WAIS, r=.75 es. .85; Raven PM and АССТ, r 2.67 
vs. .70; WAIS and AGCT, r = .82 vs. .84. 


3. Construct Validity 
There appeared to be too many variables affecting the intercorrelations 
between the Raven PM, the WAIS Verbal, the WAIS Performance, and 


the AGCT (Table 1) to explain their relationships in terms of a single gen- 


eral factor of intelligence, if one applied the Spearman criterion of zero tetrad 


differences (5, pp. 248 #.). 
However, when the intercorrel 
Cohen's WAIS factors, the AGCT part scores, and age were submitted to 
a principal components analysis, all variables except age loaded heavily (Table 
4) on a single factor of general intellectual functioning. It appears that all 


ations (Table 3) among the Raven PM, 


TABLE 4 
PRINCIPAL COMPONENTS ANALYSIS ОЕ VARIABLES 
(N —91) 

Variables A B € р Е 
Аве —13 53 19 —05 —08 
Cohen M 80 29 —15 12 —09 
Соһеп М 7+ 15 —19 —31 06 
Соһеп РО 72 —16 13 —13 —26 
AGCT V 86 26 11 15 03 
AGCT N 90 02 —02 02 22 
AGCT BL 72 —25 31 —04 10 

—14 12 —11 


Raven PM 76 —29 
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of these tests, including the Raven PM, are somehow, in spite of their dif- 
ferences, all measures of general intelligence. 

The same matrix of intercorrelations (Table 3) was submitted to varimax 
rotation, and five factors were identified ( Table 5). Since the sample size was 
relatively small (N = 91), it seemed appropriate to employ a rigorous cri- 
terion for loading. On that basis a cut-off of .40 was employed in determining 
factor content. 

On Factor A, the only variables which did not load were age (.03), Cohen's 
Perceptual Organization (.34), and AGCT Block Counting (.31). Although 

TABLE 5 


ROTATED FACTOR MATRIX 
(Number of Iteration Cycles 6) 


Factors 

Variables A B С р Е 
1. Age 03 —59 —07 —01 —04 
2. Cohen V 79 —03 16 27 24 
3. Cohen M 47 05 22 62 19 
4. Cohen PO 34 15 41 23 $3 
5. AGCT V 76 —09 43 19 18 
6. AGCT N 66 18 49 36 08 
7. AGCT BL 31 21 68 16 24 
8. Raven PM 53 H 28 16 36 


Note: Decimals omitted. N = 91. 
age clearly did not contribute, the other two barely missed meeting the crite- 
rion. Factor A also appeared to be a general factor of intellectual functioning, 
predominantly verbal. 

Factor B loaded only on age (—.59) and the Raven PM (.44), and ap- 
peared to be an age factor. This finding supported that of our earlier study 
(2), and Raven’s contention of a negative relationship between scores on his 
test and age. 

Factor C loaded on Cohen’s Perceptual Organization and all of the AGCT 
parts. Since these four tests were all timed, and the AGCT parts highly so, 
this factor seemed to be clearly a time or speed factor. 

Factors D and E appeared to be similar to Cohen’s Memory and Perceptual 
Organization Factors, respectively, since those variables were the only ones 
which loaded on these two factors. It is interesting to note that the Raven PM 
missed loading (.36) on Factor E (Perceptual Organization), and, accor- 
dingly, as in our previous study (2), did not appear to be uniquely or clearly 

significantly a measure of this Cohen factor, as we hypothesized it might be. 
Possibly Factor D (Cohen’s Memory Factor) might need to be thought of as 
a Number factor. 


f~ 
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Although Factor A, which we regard as a general factor of intellectual 
functioning, was highly verbal, the Raven Progressive Matrices loaded on it. 
One might speculate as to what extent performance on the Raven PM de- 
pended on verbal ability. The investigators have noted that some subjects, 
especially оп the more difficult items, appeared to "talk their way" through 
the Progressive Matrices: that is, they seemed to explain to themselves ver- 
bally what the relationships were. It might be interesting to investigate this 
kind of phenomenon more systematically. 

A varimax rotation, in summary, showed that the Raven Progressive Ma- 
trices was positively related to a general factor of intellectual functioning, 
even though it was highly verbal, negatively related to an age factor, unre- 
lated to speed and memory factors, and no more than minimally related to a 


perceptual organization factor. 
D. SUMMARY 


(a) The untimed Raven Progressive Matrices showed a surprising split- 
half corrected reliability coefficient of .96 (№ = 91). (2) Both a principal 
components and a varimax factor analysis indicated that the Raven Progressive 
Matrices loaded significantly on a factor of general intellectual functioning, 
even when it was heavily determined by verbal content. (c) Performance on 


the Raven Progressive Matrices was negatively related to age. 
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SCALE CHECKING STYLE AS A FUNCTION 
OF AGE AND SEX IN INDIAN AND 
HUTTERITE CHILDREN* ! 


Department of Psychology, University of Manitoba 


SHIRIN ScHLUDERMANN AND EDUARD SCHLUDERMANN 


А. INTRODUCTION 
Werner (16) emphasized that a young organism in the process of devel- 


ased differentiation, hierarchization, and integration. 
of differentiation with special reference 


to cognitive and mental development. Developmental psychologists have en- 


deavored to enunciate and define a variety of criteria whereby differentiation 


in children and adults may be studied. The development of cognitive dif- 
rea where a considerable number of 


ia goes through incre 
erner had proposed the principle 


ferentiation in young children is one such a 
Criteria and hypotheses are generated and tested (0.2, 2;:3;,6; 7,8; B 10, 
17, ete.). Numerous devices have also been developed to test or assess the 


development of differentiation in young children. It has frequently been ob- 
rating scales—the direction and lo- 


эёмей that in psychometric їезїз—©.Р.› : 
cation of responses are partly a function of the scales used, objects rated, as 
well as the checking style of the subject (e.g, 11). The intent of this paper 
is to study the relationship between the аре and sex of the respondent and 


checking style on a rating scale—i.c« the semantic differential—in two in- 
dependent samples of societies which differ greatly in language and/or social 


Systems from the modern western society. 
"The semantic differential can be d with young children and has been 
Shown to be a reliable instrument (e.g. + 5, 12). Osgood, Suci, and Tan- 
Nenbaum (13) have demonstrated that scale checking styles on semantic dif- 
ferential scales are influenced by a number of variables: e.g., intelligence, age, 
anxiety, mental disorder of the respondents. It is also a well known fact that 
stiating between complex stimuli than 


а younger child is less capable of differer de 
ап older child or adult. On the basis of Werner's principle of differentiation, 


it may be stated that a younger child shows relatively less cognitive differen- 
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tiation than an older child. Among several indices that may be used to study 
greater ог lesser cognitive differentiation, scale checking style may be con- 
sidered to be one of the indices. "Therefore, it may be hypothesized that younger 
children will tend to respond more often on the extreme ends of a rating scale 
(semantic differential) than the older children. The older children, on the 
other hand, will tend to respond more often on the middle position or neutral 
position of the scales than will the younger children. It may also be assumed 
that early social orientation may influence differently the cognitive styles of 
boys and girls. It is assumed, further, that the relations observed in non- 
American societies between age and sex of the respondents and their scale 
checking styles would be similar to that already observed in the U.S. samples. 


B. METHOD 


'The data reported here were obtained as part of large cross-cultural in- 
vestigations on the social role perceptions and differentiations among chil- 
dren (see 14, 15). 

The semantic differential scales (form 11) were used, as described by 
Osgood et al. (13). In order to minimize error variance, role concepts, scales, 
and method of administration were kept constant in two independent samples. 
One of the samples consisted of Ss at two discrete age levels: i.c., eight- and 12- 
year-old children (Indian sample); the second sample consisted of Ss on à 
continuous age range of eight to 13-year-old children (Hutterite sample). 
The five-point scales were used by all the Ss. 


1. Subjects 


Two independent samples of children were selected from North-Central 
India and a communal society in the Canadian Prairies. The sample from 
the northern provinces of India consisted of 430 Hindi-speaking children who 
were eight and 12 years old at the time of testing. Because of the fact that 
intelligence tests in Indian schools ате not used widely, children's ЈО esti- 
mates were based on teachers’ ratings and Ss’ academic records. The Ss with 
extremely low or high /Qs were eliminated so that such extreme differences 
in [Qs would not unnecessarily bias children's responses on the semantic 
scales. Information regarding the father’s occupation and income provided the 


basis for estimates of socioeconomic status in the Indian sample. The majority 
of children belonged to the Indian middle class. 


The Canadian Prairie sample came from 20 Hutterite colonies in rural 
Manitoba. The Ss were eight to 13-year-old boys and girls currently enrolled 


4 


P- 


„ ы 


S. SCHLUDERMANN AND E. SCHLUDERMANN 255 


ш the public schools in Hutterite communal colonies. "Three hundred and 
fifty-nine Ss were selected on the basis of their school achievement records 
and teachers’ ratings. The children ranged in their abilities from average to 
somewhat above-average. In order to control somewhat for the effects of ex- 
treme differences in intelligence on scale checking style, very dull and out- 
standing students were eliminated from the sample. Socioeconomically, the 
Hutterite colonies form a homogeneous social group located in rural areas. 


2. Semantic Differential Scales 

A set of 15 bipolar scales were selected which tended to have high factor 
loadings on at least one of the evaluative (E), potency (P), and activity (A) 
d by Osgood eż al. (13). Further, selection of scales was 
, representational character, and empirical vari- 


ries of successive empirical tests before 
he polarity direction 


dimensions, as reporte 
based on relevance, simplicity 
ance. The scales were subjected to a se 
the final selection was made. Care was taken to alternate t| 
of scales so as to prevent formation of positive preferences. Hindi equivalents 
of the English words in the semantic scales were obtained from a number of 
independent bilingual judges in India. The judges were asked first to provide 
the English equivalent Hindi qualifiers and, next, to give their opinion on the 
Hindi qualifiers provided by others. The Hindi qualifiers which received the 
highest degree of consensus as being the best equivalents to the corresponding 
English words were finally selected. 

The scales so selected were as follows: we 
slow-fast, interesting-boring, 
less, sad-pleasant, soft-hard, 


active-quict, small-large. 


ak-strong, important-common, 
afraid-brave, busy-lazy, dull-bright, useful-use- 
unfriendly-friendly, gentle-rough, dirty-clean, 


3. Concepts 
ere as follows: (4) Family roles (males) : grand- 
females) : grandmothers, moth- 
onal): doctors, teachers, presi- 
waitresses, cowboys; (e) 


The social role concepts w 
fathers, fathers, brothers; (^) Family roles ( 
ers, sisters; (c) Occupational roles (professi 


dents; (d) Occupational roles (services) : janitors, 
boy-friends, girl-friends. 


Self-Concept: me; (f) Friends: | 
The bilingual judges also offered the concept role equivalents in Indian 
culture. In the Hutterite communal sample, the following adaptations were 
made for the occupational role concepts: (а) Occupational roles (males) : 
farm-foremen, pigmen, doctors; (b) Occupational roles (females) ; kitchen- 
women, garden-women, kindergarten teachers. 
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C. RESULTS 


Responses were scored for extreme—1 and 5—middle—2 and 4—and neu- 
tral—3—scale positions. These scores were summed for all the adjectival pairs 
over all concepts to provide "style" scores of an individual respondent. Sum- 
ming overall subjects" three separate style scores provided the group scores of 


TABLE 3 А 
ANALYSIS OF VARIANCE: INDIAN CHILDREN’S RESPONSE CHECKING STYLE 
(Ages eight and 12 years) 


Source df $$ MS F р 
Between 429 3,065.81 
Sex 1 57.84 57.84 8.61 05 
Age 1 145.85 145.85 21.74 01 
Sex X Age 1 4.04 4.04 .60 NS 
Error (bet.) 426 2,858.08 6.71 
Within 860 131,566.50 
SCORE 2 88,277.88 44,138.94 1,051.18 .001 
SCO X Sex 2 1,058.49 529.25 12.60 01 
SCO X Age 2 6,292.09 3,146.04 74.92 01 
SCO X Sex X Age 2 161.27 80.63 1.92 NS 
Error (within) 852 35,776.77 31.99 


extreme (E), middle (M), and neutral (N) scale positions. "Tables 1 and 2 
indicate the checking style scores for the Indian and Hutterite samples respec- 
tively. 

The trend is clear in both the tables, indicating that younger-age children 
check the extreme scale positions relatively more often than they check the 
middle or neutral positions. This tendency seems to change steadily (Table 2) 
with age. As children grow older, they tend to check increasingly the middle 
positions. For the two age groups of eight- and 12-year-old boys and girls, 
there is an almost exact replication of response checking scores in the two 
different and independent samples. This clear trend was tested for signifi- 
cance by a complex analysis of variance design. The raw scores were trans- 
formed by the square root formula VX + Ж I. The variables in the 
analysis of variance consisted of three style scores X two sexes XX two ages 
(or six ages for the Hutterite sample). The summary tables employing this 
complex analysis are presented in Tables 3 and 4. The differences between 
three style scores are found to be highly significant at p < .001 level. There 
is also a significant score X age interaction (p < .01) in both the samples. 
Thus, so far, influence of age on response checking style is found to be sig- 
nificant in two widely different social-cultural groups. 

In the case of the Indian sample, there is also a significant interaction effect 


чу, 
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(5 < .01) between score and sex. Such an effect is missing in the Hutterite 
sample. Moreover, there are significant age difference effects (p < .01) be- 
tween the two discrete age groups of eight and 12 years in the Indian sample. 
There is also a somewhat significant sex effect (№ < .05) among Indian 
respondents, 


TABLE + 
ANALYSIS OF VARIANCE: HUTTERITE CHILDREN'S RESPONSE CHECKING STYLE 
(Ages eight years to 13 years) 


Source df SS MS F ГА 
Between 358 6,309.34 
Ѕех 1 7.18 718 Al NS 
Age 5 243.55 48.71 2.79 NS 
Sex X Age 5 6.69 1.34 .007 NS 
Error (bet.) 347 6,051.92 1744 
Within 718 106,477.69 
SCORE 2 67,973.81 33,986.91 665.36 .001 
SCO X Sex 2 257.17 128.58 2.52 NS 
SCO X Age 10 2,518.58 251.86 4.93 01 
SCO X Sex X Age 10 279.66 27.97 55 NS 
Error (within) 694 35,448.47 51.08 


D. Discussion 


The results support the stated hypothesis that older children make increas- 
ingly greater use of middle scale positions, while the younger ones check 
the extreme positions more often than do the older ones. Thus, there is evi- 
dence of increased cognitive differentiation with increase in developmental age. 
It is clearly understood here that age per se is only a convenient yardstick to 


mark the developmental changes in cognitive differentiation. To this extent, 
ated that there is a progression from 


Werner’s (16) hypothesis is well substantia d thi RH 
lesser differentiation to increased differentiation ın cognitive growth. It is all 
al cognitive differentiations could 


the more remarkable that these development 
be studied in widely different culture groups by means of the same psycho- 


metric instrument (semantic differential). 

Light and Zax (11) reported highly similar changes in response checking 
style with changes in developmental ages of children. Osgood е? al. (13) have 
offered further evidences of the same sort. To this extent, the third hypoth- 
esis is also supported that changes in response checking style with age are 
similar in Indian, Hutterite, and American subjects studied. The second 


d to the influence of sex differences on response checking 
Indian respondents show 


utterite respondents do 


hypothesis pertaine 
styles, The hypothesis is partly supported insofar as 


a significant difference in this regard, whereas the H 
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not show such significant differences. Here, it is interesting to note that in a 
number of studies dealing with American subjects, the sex differences were 
found to be significant (e.g., 1, 13), and in some others they were not sig- 
nificant (e.g., 11). It may be mentioned here that Osgood et al. (13) inter- 
preted the data as evidence for three difficulty levels: i.e. all-or-nothing, 
“black-white” decision being the easiest, intermediate level of difficulty is the 
"neither" (neutral) judgment, and the most difficult level is presented by 
the modified scale positions—e.g., 2 and + in five-point scale—which requires 
respondents to make finely graded judgments. The data presented in this 
study also seem to be consistent with the Osgood e al. interpretation. 


E. SUMMARY 


А sample of 430 children from India and another sample of 359 Hutterite 
children from Manitoba were studied to determine the possibility of a rela- 
tionship between age and sex of the children and the tendency to use more 
frequently the extreme—1 and 5—or middle—2 and 4—or neutral—3—scale 
positions of the five-point semantic scales. Frequency counts were made for 
the number of times a subject made an E, M, or N rating. The group means 
and SD were obtained for E, M, and N ratings and were tested for significant 
differences by a complex analysis of variance design. The age effects in rela- 
tion to E, M, or N ratings were found significant in both samples. Significant 
sex difference effects were obtained in the Indian sample but not in the Hut- 
terite sample. The results are discussed in the context of Werner's hypothesis 
of cognitive differentiation, as well as the Osgood et al. interpretation of three 
difficulty levels involved in the judgmental process. 
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А COMPARISON OF VOLUNTEERS AND NONVOLUNTEERS 
FOR ASTRONAUT TRAINING*? 


Department of Psychology, Ohio University and Naval Acrospace Medical Institute 


L. К. WATERS AND ROSALIE К. AMBLER 


A. INTRODUCTION 


Many projects involving performance in hostile environments utilize vol- 
unteer personnel. Studies by Ambler, Berkshire, and O'Connor (2), Bair 
and Gallagher (3), and Gunderson (5) suggest volunteers for such projects 
are superior to nonvolunteers on several measures of ability, performance, 


and motivation. However, for continuing programs it is necessary to main- 


tain a rate of volunteering and volunteer quality over an extended period of 


time to provide an adequate input of personnel to the program. 

Since 1959 all naval aviation students successfully completing the preflight 
course have received a form offering them the opportunity to volunteer for 
future duties involving space flights. They are instructed to return the form 
within one week if they wish to volunteer. No guarantee is given that they 
will be selected for such duties, and they must confirm their volunteering at 
further specified points in training if they wish to continue to be considered 
as a volunteer. Early in the project, Ambler, Berkshire, and O'Connor (2) 
compared 335 volunteers and 914 nonvolunteers (pooled across all types of 


flight students) on scores on aptitude tests taken for entry into the program, 
peer ratings of officer potential, and rate of attrition 


up was superior on all measures. 
investigation to determine (a) if dif- 


ferences between volunteers and nonvolunteers continued to exist after the 
project had been in operation for approximately four years, and (b) if any 
differences found were consistent across the various types of students under- 


going training. 


preflight course grades, 
from flight training. The volunteer gro 


'The present study was а follow-up 


B. METHOD 


The data for this study were based on the records of 2252 students (990 
officer and 1262 nonofficer) who completed preflight training during a sin- 
Massachusetts, on May 26, 1969, 
Massachusetts. Copyright by 
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gle year. 'The data were divided into six groups as follows: Nonofficer: 1) 
Naval and Marine Aviation Cadets (commissioned at end of flight training), 
2) Aviation Officer Candidates ( AOC's are college graduates who have not 
participated in NROTC; they are commissioned at the end of the 16 week 
preflight training course); Officers under Instruction (ОГ): 3) Naval 
Academy Graduates, 4) NROTC Graduates, 5) Marine Officers, 6) 
“Other” Officers (procured from other sources—e.g., Newport Officer Can- 
didate School, Coast Guard, Merchant Marine Academy). 

For each group, volunteering information, screening test scores, academic 
grades, and data on attrition from flight training were obtained. In addition, 
peer ratings on military leadership potential were obtained for the nonofficer 
groups. 


C. RESULTS AND Discussion 


Table 1 presents the percentages of volunteers in the various groups. 
There was a marked difference between the officer and nonofficer groups in 
volunteering rate. Officers volunteered significantly (p < .01) more than 
nonofficers; Naval Academy graduates showed the highest rate of volunteer- 
ing, while Aviation Officer Candidates ( AOC's) had the lowest rate. ‘These 
two groups were also high and low in the Ambler, Berkshire, and O'Connor 
(2) study. The greatest change in volunteering rate compared to the earlier 
study was the increased rate for NROTC graduates (24 percent to 33 per- 
cent) and Marine officers (25 percent to 39 percent). The overall volun- 
teering rate was almost identical to that found earlier (27 percent vs. 26 
percent). 

'Table 2 gives the means and standard deviations of scores made by as- 


z TABLE 1 
PERCENTAGE OF NAVAL FLIGHT STUDENTS VOLUNTEERING FOR POTENTIAL ASTRONAUT DUTY 
Student type Number uns Volunteers % Volunteers 
Nonofficer 
Cadet 723 163 22.6 
Aviation Officer Candidate 539 92 174 
Total Nonofficer 1262 255 20.2 
Officer 
Naval Academy Graduate 203 82 40.4 
NROTC Graduate 249 83 33.3 
Marine Officer 366 141 38.5 
Other 172 32 18.6 
Total Officer 990 338 34.1 


Total combined 2252 593 26.5 u 
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M TABLE 2 
Mean Comparisons OF VOLUNTEERS AND NONVOLUNTEERS ON SCREENING TEST SCORES 
Student Volunteers Nonvolunteers 
_ type Variable x SD х SD 
Cadets AQT 5.89 149 5.76 1.53 
кө ЕАК 6.58 1.30 6.37* 2+ 
AOC AQT 647 1.59 5.89** 1.51 
FAR 6.67 1.31 6.18** 1.20 
OI 
Academy AQT 7.61 1.27 7.198: 1.28 
ЕАК 6.15 EZ] 5.50** 1.56 
ОТ 
NROTC AQT 7.66 1.25 7.28* 1.38 
FAR 6.84 147 6.40* 1.69 
OI 
Marine AQT 6.78 1.52 6.01** 147 
FAR 6.37 1.52 6.09 1.34 
OI 
Other AQT 647 145 6.49 1.48 
FAR 5.80 1.47 6.14 146 


* Difference between volunteers and nonvolunteers significant at № < .05. 

** Difference between volunteers and nonvolunteers significant at № < .01. 
tronaut volunteers and nonvolunteers on the Aviation Qualification Test 
(AQT) and the Flight Aptitude Rating (FAR). These are tests taken at 
the time of application for naval air training. The AQT is a test of general 
ability or intelligence, and the FAR is a composite score derived from tests 
of mechanical ability, spatial ability, and biographical data. The AQT has 
been shown to be related to ground school grades, while the FAR has shown 


Predictive validity for flight training performance (1). Volunteers for astro- 
naut duty scored significantly (^ < .05) higher than nonvolunteers in eight 


of 12 comparisons. 

The first row for each student type in Table 3 presents the data for the 
Preflight final grade. Volunteer means were significantly (p < .05) higher 
than nonvolunteer means for all student types. Peer rating means and stan- 
dard deviations are given in the second. row of the Cadet and AOC groups. 
This variable is a score based on student nominations of their classmates for 
leadership ability and has demonstrated validity for predicting future pilot 
Performance (4). The mean peer rating was significantly (5 < .05) higher 
for volunteers in both the Cadet and AOC groups. 

Table 4 shows the attrition from flight training for the volunteer and non- 
Volunteer groups. Preflight attrition cases were not included, since these stu- 
dents did not have an opportunity to volunteer. In all except the Cadet 
£roup, volunteers had smaller attrition rates than nonvolunteers. No reason 
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TABLE 3 
MEAN COMPARISONS OF VoLUNTEERS AND NONVOLUNTEERS ON PREFLIGHT PERFORMANCE 
m Volunteers Nonvolunteers 
Studen! RA rar 
type Variable X SD X SD 
Cadets Preflight final 51.20 472 50.28* 4.67 
Peer rating 51.90 10.53 50.18* 9.72 
AOC Preflight final 53.61 4.84 оре 473 
Peer rating 55.37 9.93 51.0 ў 
OI Е nee 4.96 
Academy Preflight final 57.18 4.97 54.62 
OI 9 
NROTC Preflight final 58.60 418 55.92** 5.0 
x) 31** 5.65 
Marine Preflight final 55.45 5.52 50.3 . 
OI = m 
Other Preflight final 54.16 4.85 47.50 | 8. 


* Difference between volunteers and nonvolunteers significant at $ < .05. 
** Difference between volunteers and nonvolunteers significant at № < -01. 


for the marked difference between cadets and the other groups in the attri- 
tion rates of volunteers and nonvolunteers is available. 

In general, volunteers for astronaut duty were superior to nonvolunteers 
on measures of ability, early performance in training, and completion rate. 
These data support the results of Ambler, Berkshire, and O'Connor (2) ob- 
tained during the early phases of the volunteering project and indicate that 


volunteering rate and volunteer quality are maintained over extended time 
periods. 


D. SUMMARY 


After a program of offering flight students the opportunity to volunteer 
for astronaut training had been in effect several years, volunteers and non- 


TABLE 4 
ATTRITIONS FROM FLIGHT TRAINING FOR VOLUNTEERS AND NONVOLUNTEERS 
Volunteers Nonvolunteers 
Student type N 96 N 9b 
Cadet 52 31.9 140 25.0** 
Aviation Officer Candidate 23 25.0 153 34.289 
Naval Academy Graduate 13 15.8 20 16.5 
NROTC Graduate 16 19.2 36 214 
Marine Officer 24 17.0 52 23.1** 
Other Officer 9 28.1 46 32.9 
'Total 117 23.1 47 27.0 


** Difference between percentage attritions for volunteers and nonvolunteers sig- 
nificant at р < .01. 
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volunteers were compared on selection test scores, preflight performance, and 
attrition from flight training to determine if volunteer quality was main- 
tained. In general, volunteers were higher on ability and preflight perfor- 
mance measures than nonvolunteers. Attrition data were inconsistent. Vol- 
unteering rate was the same as that found during the initial stages of the 


Program, 
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QUANTITATIVE BEHAVIOR OF WARRING NATIONS* ! 


Department of Psychology, Stanford University 


Јонх Vorvopsky* 


A. INTRODUCTION 


Warfare is demonstrated to be an orderly and repetitive behavioral event. 


During the last 100 years, the United States has fought four major wars 
The Civil War, World War I, World War II, and the Ko- 
t years, we have been engaged in a fifth and not 
has occupied our minds, our 


to conclusion: 
rean War. For the past eigh 
yet concluded war, the war in Vietnam, which 
troops, and our resources longer than any other w 

And yet, despite its record duration, the Vietnam W 
in the same orderly manner as our four previous 
avioral patterns of nations at war, 
during the last 


ar in our history. 
ar appears to be pro- 


gressing, as of this writing, 
wars. An inquiry into the repetitive beh 
of the United States, our allies, and our enemies 
100 years, reveals that we and the enemy are acting today in the same way as 
d in the past. Because of this orderliness of our behavior, it now 
a crisis point in the Vietnam War where 


Particularly 


we have acte 


appears that we may be approaching 
another major escalation is indicated. 


either a settlement is possible or 
tnam War and the previous wars 


This orderly manner in which the Vie 


an examination of the phenomena—the vital sta- 


have progressed is based on 
tistics—of modern armed conflict by means of cyberne 
an identification and demonstration of the orderly progress of warfare that is 


the primary concern of this paper, ther with the derivation of equations 


atically the rel 


tic analysis. Thus, it is 


toge 
that express mathem ationships among the vital statistics of 


Warring nations. 
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The present investigation is purely of an exploratory nature, it should be 
emphasized ; and no pretense is made that it is possible at present to explain 
fully and in detail all the factors which result in the orderly progress of war- 
fare. 

All such endeavors, though, however bravely undertaken and whatever 
their results may be, have a theoretical origin. This research is no exception. 
In fact, it has two origins: one in the quantitative behavioral concepts of the 
psychologist S. S. Stevens (9) and the other in the cybernetic concepts of the 
mathematician Norbert Wiener (12). In essence, Stevens shows that quanti- 
fication of behavior is readily achieved by the assignment of numerals to dis- 
tinguishable behavioral events in the carcer of living systems. "These behav- 
ioral events become the system variables, which are time variant. The system 
variables, in addition, appear to have the characteristics inherent in linear 
second-order feedback-control systems of the type first conceived by Wiener. 
Furthermore, the work on the quantitative nature of warfare performed by 
L. F. Richardson (8) and F. W. Lanchester (5) supports these general ob- 
servations. 


B. Tue VrrAL Statistics or WAR 


The quantitative behavioral data which best describe modern warfare fall 
into three categories: battle strength, battle casualties, and battle deaths. By 
coincidence, these are the vital statistics of war, because nations habitually 
count their troops, assess their attrition, and record the results of combat for 
current and historical uses. From the point of view of the analyst, though, 
these particular items of data have a value in addition to their ubiquity. "They 
have an advantage over other war statistics, because they appear to be re- 
markably stable and accurate, since in most cases they have stood the scrutiny 
of countless researchers over long periods of time. 

It should be further emphasized that only official government sources are 
used for the statistics in this analysis, with two exceptions. French and Ger- 
man battle deaths and casualties during World War I were taken from Win- 
ston S. Churchill’s compilation of official records (2). Union and Confeder- 
ate statistics for the Civil War were taken from Thomas L. Livermore’s 
compilation of official records (6). 

The basic vital statistics, battle strength, battle casualties, and battle deaths, 
unless carefully defined, may mean different things to different analysts. In 
this analysis, the numbers used for all wars represent only Army personnel, 
officers and enlisted men, reported at the specified instant of time that is un- 
der consideration. This applies whether the troops are Union, Confederate, 
French, German, British, or American. In the case of the Vietnam War, 
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however, numbers of total Department of Defense forces, which includes 
Army, Air Force, Navy, Marine Corps, and Coast Guard, are used separately 
and in addition to the numbers used for the Army personnel. 

For the United States Army, battle strength is the total number of officers 
and enlisted men at any instant, /, deployed in the battle zone. For the Viet- 
nam War, the battle zone is considered to be South Vietnam, and the number 
of Army personnel in South Vietnam at any instant is considered to be the 
Army battle strength. This does not include Army personnel who are off 
shore or are in any other location, such as Thailand. 

Army battle strength for the Korean War is the total number of Army 
personnel on the Korean Peninsula at any instant (3) ; for World War I], 
it is the total number stationed outside the continental limits of the United 
States at any instant of time (4) ; for World War I it is the total number in 
France at any instant (1, p. 15). Similarly, British Army battle strength is 
the total number of British Army personnel in France at any instant of time 
during World War I (1, p. 14). Comparative statistics for German Army 
battle strength during World War I were not available in suitable form. 

For the Civil War, the Union Army battle strength is the total number of 
personnel on the Union rolls at any instant, and the Confederate Army battle 
strength is the total number of personnel on the Confederate returns at any 
instant (6, p. 47). Of all the war statistics included in this analysis, the Civil 
War strength data show the widest fluctuations and are considered the least 
reliable of all the data. 

Army battle casualties for the Vietnam War (15), the Korean War (14), 
and World War II (13) are the accumulated totals from the beginning of 
instant of time, ¢, of all Army personnel killed 
in action, dead as the result of wounds or injuries received in action, wounded 
missing in action, captured by the opposing forces, or 
the authorities of a neutral country. The 
term “in action” characterizes a casualty status in which an individual suffered 
death or injury as a direct result of enemy action, either while engaged in a 
battle, going to it, or returning from it. Excluded from United States Army 
battle casualties are injuries or deaths that resulted from exposure to the ele- 


ments or from diseases. 
Army battle deaths for the 


War II are the accumulated tota 
instant, №, of Army personnel who were killed in action, 
injuries or wounds received in action, or declared dead while missing in ac- 


tion. 


For the United States Army in World V 


the particular war up to any 


or injured in action, 
taken into custody as internees by 


Vietnam War, the Korean War, and World 


ls from the beginning of the wars up to any 
died as a result of 


Var I, the time variants in the bat- 
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tle casualties are not known. The time variants in battle deaths are known. 
Total U.S. battle casualties at the end of World War I included dead, 
wounded, missing, and prisoners (1, pp. 120-122). Similarly, mutually caused 
British versus German (16), and French versus German (2, p. 39) battle 
casualties include dead, wounded, missing, and prisoners (2). Only the U.S. 
Civil War battle casualties, which included dead and wounded, exclude miss- 
ing and prisoners because the data were unavailable (6). The interested audi- 
tors of this paper should consult the original sources cited for detailed defini- 
tions of battle strength, battle casualties, and battle deaths. 

The British-German and French-German World War I and U.S. Civil 
War vital statistics should be considered as approximations. Differences exist 
in the data for each war, which must be considered by the analyst as measure- 
ment error. "The size of the error, unfortunately, is not known at this time. 
Also, when the analyst compares the errors between wars, he should assume 
that part of the errors are due to differences in accounting procedures. The 
errors that might occur in the United States data for the Vietnam War, the 
Korean War, and World War II, however, can be considered the smallest; 
since statistics for the three wars are compiled by the same government, are 
close together in time, and major changes in accounting procedures do not 
appear to have been made. No detailed studies have been made of the exact 
accounting methods employed by the other governments because of the ex- 
ploratory nature of this research. The data have been taken more or less on 
faith from official sources, except where noted. The reader should be assured, 
however, that when conclusions are drawn, he will be alerted to the possible 
error in the base statistics so that he may discount the conclusions to the de- 
gree he chooses. 


C. METHOD or ANALYSIS 


A mere compilation in tabular form of these three items of data, arrayed 
over time, reveals very little about the progressive nature of the statistics for 
a particular war, the relationships that exist between opposing forces in a par- 
ticular war, or the relationships that exist among the statistics of different 
wars. Yet it is an identification of these significant relationships that is being 
sought in this analysis through a graphic and mathematical manipulation of 
the three sets of vital statistics. Thus, here are identified and separated out for 
examination the relationships between battle casualties and battle deaths, be- 
tween battle strength and battle casualties, between opposing forces (the ef- 
fect of their coupled statistics), and between battle strength and time. Other 
by-products, such as the effectiveness of weapons systems, can also be identi- 


fied. 
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In order to accomplish this purpose of identifying significant relationships, 
the standard scientific method of analysis is followed. The published statistics 
were gathered and put into the form dictated by the theories of Stevens and 
Wiener. For example, one important feature of the dictated form is to record 
accumulated totals over time. Any two sets of these data, thus compiled, de- 
fine coordinate points. Semilog paper is used to plot the relationship of the 
statistics to time, and log-log paper is used to plot the relationship of one vital 
statistic to another. This is done because semilog and log-log paper best illus- 
trate the respective exponential and power relationships there were found to 
exist. Finally, the data, when compiled in this manner for all five wars, are 
observed to fit very closely four simple mathematical laws. 

The connection of one data point to another on the graph paper, of course, 
results in two types of lines. The exponential functions produce curves and 
straight lines. The power functions produce straight lines. In presenting these 


ly a suitable mean line has been drawn through the data points 
red 


data graphi 
by inspection. Due to the exploratory nature of the research, it is conside 


unnecessary to employ more rigorous methods of curve fitting. 
Thus, it must be said that the basic method, or scheme of analysis, is not 


materially different from that followed in any other responsible scientific 
What is different here are the items chosen for analysis, the 
nt, and the results of their manipulation. 


Investigation. 
uniqueness of their arrangeme 


D. BATTLE DEATHS AND BATTLE CASUALTIES 


Identification of the mathematical relationship between battle deaths and 


battle casualties is the first of the four equations presented here to demon- 
strate the orderly progress of warfare. The relationship is shown in Figure 1 
by a graphic representation of a summary of the United States Army statis- 


tics for the Vietnam War, the Korean War, World War 11, World War I, 
and the Civil War (both Union and Confederate sides). In order to avoid 
a confusion of lines and symbols, however, the British and German statistics 
for World War I are not shown in Figure 1, even though they portray an 
identical relationship to the ones actually shown. 

n battle deaths and battle casualties can be de- 
pection of the plotted curves by a power 
battle deaths at any instant, 2; С is the 
nd y and 8 are constants, which as a 
weapons systems over 
ect troops. y and à 
ths and battle casu- 


This relationship betwee 
scribed mathematically from an ins 
function, D = уб, where D is the 
battle casualties at the same instant, /; a 
by-product define the net effectiveness of the enemy's 
the effectiveness of all countermeasures designed to prot 
define a family of curves, or levels, between the battle dea 
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alties for each war. This formulation permits the analyst to view a coalition 


of nations as one weapon system. 
Since the time-variant nature of the United 


States Army battle casualties 


for World War I is not known, but the final battle casualties and the time- 
variation in battle deaths are known, the slope given to the dashed line in 
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Figure 1 that is drawn from the final casualty point is the average slope, Save- 
This average slope was computed by adding the slopes for the Union and 
Confederate sides of the Civil War, the United States in World War П, 
and the United States in the two phases (to be defined later) of the Vietnam 
War and dividing the sum by five. 

Quite automatically, the formulation also establishes the upper boundary 
of enemy weapons system effectiveness. On the graphs, Figure 1 and Figure 
2, this maximum effectiveness boundary represents points where, at a given 
time, all battle casualties are also battle deaths. Hence, any data point which 
falls above this boundary represents the physically impossible condition in 
which more fatalities than casualties occur. 'The main point here is that the 
purpose of the weaponry is to destroy the enemy's fighting ability by killing 
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his troops. "Therefore, the relationship of deaths to casualties is one index of 
weapons effectiveness. The upper limit of weapons cffectiveness, by this defi- 
nition, is when casualties equal deaths. 

In no war has a sudden improvement in enemy weapons system effective- 
ness been more noticeable than in the Vietnam War, represented separately 
in Figure 2 in terms of Department of Defense battle deaths to battle casual- 
ties. Here, a smoothly progressive and constant relationship is observed to 
have been maintained from December 31, 1961, through December 31, 1964, 
which for the sake of convenience is identified as Vietnam War I. In the year 
1965, though, a significant change in the relationship occurred, which has 
held constant through June 30, 1968. This second phase of the war is identi- 
fied as Vietnam War II. 

The enemy weapons system effectiveness level on which Vietnam War I 
was fought is entirely different from the level on which Vietnam War II is 
being fought. That these two effectiveness levels are under other major Uni- 
ted States war levels is the net result of the effect of the enemy and the coun- 
tereffect of our modern medical treatment, helicopter evacuation service, and 
other new protective systems. 

These are modern times, though, and this particular enemy theoretically 
has access to an even more sophisticated weapons system than he is now em- 
ploying. It thus appears possible that the Vietnam War could suddenly esca- 
late to levels experienced during the Korean War and World War II and 
become Asian War I merely through the introduction of tanks, air power, 
missiles, and/or Chinese troops. An escalation in enemy weapons system effec- 
tiveness might also occur simultancously through the employment of Chinese- 
and Soviet-directed tactics and strategies. 

What can happen and what might happen in Vietnam in terms of battle 
deaths to battle casualties relationships, however, will lie somewhere between 
the present level and the upper boundary of weapons system effectiveness as 


shown in Figure 1 and Figure 2. The potential for change in the future, thus 
defined, lies within a limited range. 


E. BATTLE STRENGTH AND BATTLE CASUALTIES 
Identification of the ma ; И . 
We Nex ie iie pc relationship between battle strength and 
? casu: s р 
А ubi { жү Е the four equations presented to demonstrate 
the orderly progress of warfare. This relationship is shown in Figure 3 by a 
graphic representation of a summary of United States Army statistics for the 
i E e Korean Wa S 4 
Vietnam War, the Korean War, World War 11, World War I, and British 


participation in World War I. Statistics for the Civil War are not included 
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because the strength figures as a function of time for both sides are known 
to be of questionable accuracy. 

Once again a relationship pattern, one that is common to all wars analyzed, 
is immediately discernible from an inspection of the plotted curves. This pat- 
tern, too, can be described by a power function, $ = аб, where S is the to- 
tal Army strength at any instant, /; C is the Army battle casualties at the 
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same instant; a and fj are constants, which in combination define the inten- 
sity of the fighting in the battle zone. a also represents the theoretical initial 
battle zone strength at the moment the first battle casualty is sustained. In 
this sense, a is a battle-preparedness coefficient, and В is the replacement соећ- 


cient, 

"The equation, however, represents other conditions. It states that the ors 
ber of Army troops in a battle zone at a given instant is determined by the 
battle énsualties. It also states the converse. In other words, it implies that 
each new casualty causes the influx of new troops by a predictable amount 
and, conversely that new troops produce additional casualties. This is evident, 
e atical manipulation (the logarithmic derivative) is 


because if a simple mathem 

1С : 

taken, then x zs нен results. From Figure 3, 8 is seen to be less than 1, 
$ C И 

where В varies very roughly from .57 for the Korean War, .38 for World 
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War II, .37 for the British experience in World War I, and .36 for Vietnam 
War I and II. If B were zero, then battle strength would be a constant a, 
equal to the initial fighting strength. Thus £ is, then, the replacement cocffi- 
cient, where the strength, when doubled in the battle zone, causes the casual- 
ties to go up roughly by six times for all wars considered, except for the Ko- 
rean War, where they went up roughly by four times. 
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While it is acknowledged that consideration of five data points, in this case 
five wars, is not necessarily proof of the existence of a phenomenon, an ex- 
amination of the relationships between the wars suggests that two important 
conclusions can be drawn, 

The first is that there are limits in strength build-ups and casualties a na- 
tion will sustain, beyond which it either accepts defeat, changes its leadership, 
or acquires new allies. This idea is graphically illustrated in Figure 3 and 
Figure 4 by a lower boundary level, which defines the points where battle casu- 
alties and battle strengths are equal. 

This line, in effect, represents a boundary to the right of which a nation 
cannot seem to pass and still sustain its own war effort. For example, Great 
Britain at the end of World War I suffered roughly two million battle casu- 
alties with a total strength in France at the end of the war of two million 
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soldiers. Since the population of Great Britain at the time was 60 million, 
this represents 3.33 percent of her 1918 population. It is revealing to note 
that 3.33 percent of the United States population (110 million during World 
War I) is 3.67 million combat troops, or casualties, which is the point where 
the United States experience in World War I intersects the lower boundary 
when it is extended as shown by the dashed lines in Figure 3. 

Furthermore, if the two United States wars in Asia, the Korean War and 
the Vietnam War, are extended in the same manner, they intersect with each 
other and with the lower boundary level at the same point, 850,000 casualties 
and men. Also, the estimated absolute ceiling on the number of men physi- 
cally fit for active war service for the United States during World War II 
was between 15 and 16 million (7), which again is the intercept of the ex- 
tension of the United States World War II with the lower boundary level. 
This point represents 11.4 percent of the United States population of 131 
million in 1940. 

The second conclusion that can be drawn from an examination of the rela- 
tionship of battle strength to battle casualties over five wars is that there ap- 
pears to be an upper boundary level, or limit on the degree of mobilization 
that the citizens of a nation will consent to as they prepare for and partici- 
pate in a major war. This upper level becomes a sloping line when it is repre- 
sented on the graph, which in this instance is drawn backward from the point 
where all potential soldiers could become casualties. The angle of the line is 
drawn according to a slope that is the average for all wars analyzed. This 
average slope is Bave = .38. 

If one assumes that, as during World War II, 11.4 percent of the United 
States population represents the maximum number of soldiers we could pre- 
pare for combat, then our present 200 million population would produce a 
potential reservoir of 22.8 million men for the Vietnam War. The data show, 


however, that a nation does not mobilize all its potential fighting men at 
y as casualties are incurred. 


once. It characteristically mobilizes graduall 
Therefore, this gradual mobilization, mathematically represented by Bave 


-38, appears to be a behavioral characteristic of warring nations: to maint 
a balance between the maintenance of combat-eligible men in their normal 
productive activities in the society and the complete mobilization and commit- 
ment to battle of these same men in an effort to protect the nation’s way of 
life. 

Finally, when the strength to casualty rela 
nam War, as illustrated in Figure 4, it may 
activity occurred in 1965. This was the point at whic 
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tionship is applied to the Viet- 
be clearly seen that a jump in 
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calated its strength, expressed mathematically by changing а to a higher level 
of effort, in response to the change in enemy weapon system effectiveness. For 
the past two and one-half years the relationship of strength to casualties has 
not changed materially. Since the data point for June, 1968, is slightly to the 
left of the mean line, one can conclude that the United States battle strength 
is being kept down in comparison to the casualties received. "This means that 
slightly more casualties are being inflicted by the enemy than United States 
strength calls for. Furthermore, since the data point for June, 1968, is so 
close to the lower boundary level, this second phase of the Vietnam War is 
shown to be one of the most intensely fought wars the United States has ever 
experienced. 


Е. RELATIONSHIP BETWEEN OProsiNG FORCES 


Identification of the mathematical relationship between strength to strength, 


casualties to casualties, and deaths to deaths for opposing armies for a partic- 
ular war is the third of the four equations presented to demonstrate the or- 
derly progress of warfare. This relationship is shown in Figure 5 for the 
French versus German Armies, and is discernible in Figure 6 for the British 
versus the German Armics during World War I, and Figure 7 for the Union 
and Confederate Armies during the Civil War. Analyses for the other wars 
were not made, because data on enemy forces in the Vietnam Wars, and Ko- 
rean War, and World War II were cither not available or were not in suitable 
form. 


Here, too, a discernible relationship can be deduced from the resulting 
curves, which can be described by a power function, Y — jX", where X and 
Y are identical statistics at the same instant of time for opposing forces. The 
coupling coefficients are j and А. In all cases studied, however, k is found to 
be almost equal to 1. In fact, & varies between .95 and 1.0%. Thus, by setting 


dY | 
=j. 


k equal to 1, the coupling relationship assumes the simpler form of 
dX 
This equation states that if one side changes its war variables, a similar change 
occurs on the other side. Lanchester has defined j as the exchange ratio; i.e. 
the ratio between those combatants lost on one side to those combatants lost 
on the other. Also, an increase in troops on one side results in an increase in 
troops on the other. This ratio also is intuitively expected because when tw? 
armies come together in combat, firing begins almost simultaneously from 
both sides. When the combat ends, the firing ends simultaneously on both 
sides. If a large battle occurs, both sides suffer large losses at the same time. 


an. 
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This is true regardless of whether the battle is classified as a success, a fail- 


ure, a defeat, a rout, or a victory. 
Historically, the Germans have been accused of dividing their casualty sta- 
tistics in half for political purposes. This may or may not be the case. What 


WORLD WAR | - FRENCH ARMY VS GERMAN ARMY 
10,090,000 [- i — mr 
E 4 
F 31 Dec. 1918 7] 
F 81 рес. 1917 У 
Is 31 Dec. 1916 d 
O 731 Dec. 1915 
[Р - 
. 1914 _| 
= 1,000,000 |- O31 Dec. 19 | 
- Е а 
g ч 
E + 
= 
+ = 
ч 
3 F a 
5 t- Y = ix 
E + 
Е |= 2,72 
Е 
5 [Г k = 0.966 =| 
= 
o 
2. 
E | й. фы йй. Г 
тоо, ооо - 
Чоо, боо 1,000,000 10,000, 00 


GERMAN BATTLE CASUALTIES, X(t) 
FIGURE 5 
WonLp War I—FRENCH ARMY vs GERMAN ARMY 
is important is that the casualty data for both sides have essentially the same 
shape when plotted as in Figure 6. The Civil War also shows this expected 


Characteristic of similarity in shape, and even in magnitude for the casualty 
statistics, Thus the most important point to be realized here is that if the pe 
a war begins and when it might end, as indicate 

and deaths), he 


уо combatants. 


alyst is interested in when ; 
by a leveling of hostilities (measured by strengths, casualties, 
can determine this from the casualty data of only one of the tw a 
True, the other side might have a higher or lower strength ratio, or casualty 
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ratio, or kill ratio, but the leveling of hostilities occurs simultaneously and in 
like manner on both sides. The reader should be cautioned, however, that this 
conclusion is based on observation made of only two wars and within these 
two wars on only six relationships. They are Union and Confederate strength, 
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Union and Confederate casualties, Union and Confederate deaths, French and 
German casualties, British and German casualties, and British ot German 
deaths. Thus, on this basis one may conclude that opposing warring nations 
have similar behavioral patterns over time during a war even though die xà 
tios between statistics may be different. 
As this conclusion applies to the Vietnam War, one can say: If the coup- 
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ling constants do not change, then the leveling of hostilities (as measured by 
the vital statistics of warfare) is clearly discernible from the United States 
statistics alone. I£, on the other hand, the North Vietnamese escalate the war 
the first indication to the United States will be a rise in casualties above "e 
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jected levels. The coupling coefficients will then have been changed by the en- 
emy and a new and bigger war will have begun. 


С. BATTLE STRENGTH AND TIME 
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the orderly progress of warfare. Since battle deaths are related to battle casu- 
alties and in turn are related to battle strength, the relationship between ca- 
sualties and between deaths and time can be established by means of algebraic 
manipulations once the mathematical relationship between battle strength and 
time is known. 

If the behavior of warring nations can be characterized by linear second- 
order feedback-control systems equations of the type first conceived by Wiener 
and supported by the work of Richardson, Lanchester, and others (10, 11), 
then the solutions are exponential in nature. 

The general solution for a second-order system is 
where 4, B, C, Xi, and Аз are characteristic constants of the system. 
wars studied it is found that the first exponential term quickly disappears, 
while the strength is less than 10 percent of the ultimate forces. The remain- 
ing 90 percent of the ultimate fighting force is characterized by only опе time 
constant. Thus, if the analyst is concerned only with the last 90 percent of a 
war, first-order differential equations appear to describe accurately the time- 
variant nature of the vital statistics. Second-order equations appear to be 
needed only when the beginnings of wars are to be analyzed. Since the Vict- 
nam War is found to be in its terminal stages, the first 10 percent of wars 
have not been analyzed. 

To develop these ideas, one conside 
shown in Figure 8a and Figure 8b. S denotes, in this case, the 
ment of Defense strength in South Vietnam at any instant, /. The 
of battle strength, S, with time, 7, is shown in Figure 8a. It is seen that if 
the conditions which have been acting in a particular manner over the past 
three years continue in an unchanged manner, the battle strength, S, ap- 
proaches an upper limit, say S+, the ultimate number of forces that are to be 
engaged. By extrapolation in Figure 8a, one finds that S,, to a good approxi- 
mation, is 600,000 Department of Defense personnel. To obtain a simple an- 
alytical representation for the function S, one finds, as shown in Figure 8b, 
that log(S,,— 8) varies linearly with time; i.e., the curve log (S,, — 8) ver- 
sus t is a straight line. ТЕ —1/r denotes the slope of this line and if Sy denotes 
S at some initial time, to, the equation for the line is 
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Generally, for the wars analyzed here the initial strength, 80, is much less 
than the final strength, $5, at least by a factor of 10. It is, therefore, reason- 
able to assume that the ratio of initial strength to final strength is much less 
than one and the above expression may be expressed in the simpler form of 

S(t — to) 
Sy 

The constant, т, which has the dimensions of time and is referred to as the 
time constant, characterizes the variation of troop strength in the battle zone, 
S, with respect to time. For the case of Vietnam War II, as shown in Figure 
8b, one finds that т = 1.56 years; i.e., in the first 1.56 years, 63 percent of 
the troops that are to be committed to the battle zone will have been com- 
mitted. In the next 1.56 years, 86 percent of the troops will have been com- 
mitted; and in the next 1.56 years (yields a total of 4.68 years), 95 percent 
of the ultimate forces will have been committed. If the coupling relationships 
are assumed valid, then the enemy’s strength in the battle zone will have in- 
creased by the same percentages. 

In like manner, the battle strength to time relationship, plotted as the frac- 
tion of ultimate forces engaged in the battle zone, is shown in Figure 9a by 
a graphic representation of a summary of the United States statistics for the 
Vietnam War, the Korean War, World War II, and World War I (in- 
cluding the British Army). Figure 9a has been constructed in the same man- 
ner as Figure 8a in the light of the last equation. Here the strength of the 
United States Army in the battle zone has been chosen for the variable 8. Sa 
for each war is estimated. For Vietnam Wars I and II, Korean, and British 
World War I, Sa can be estimated by inspection from a graph of strength 
versus time for each war in the same manner, as $4 = 600,000 is observed 
for Department of Defense strength in Vietnam War II in Figure 8a. The 


estimated value for the United States Army in these cases is as given in Fig- 
ure 9a. 


Not so obvious is the ultimate strength that could have been mobilized by 
the United States in World War I and World War II. In these two cases 
the wars ended before the curves became sufficiently horizontal to determine 
by inspection the value of Sa. Thus, for World War II a value of 16 million 
is chosen for S+, because this was estimated to be the absolute ceiling on the 
number of men who could be considered physically fit for active war service 
at that time (7). Similarly, an S, value of 3,670,000 was chosen for United 
States participation in World War I, beéause it was the sume percentage of 
our population in France as Great Britain’s two million soldiers. in France 
was of her total population in 1918, 
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If one choose arbitrarily the initial time, ѓо, such that 85/8, is at least less 
than .10, the variation of S/S, for the different wars can then be plotted, and 
is shown in Figure 9a. The time constants for the different wars can then be 
determined. 

The last equation shows that S(t—t,) /S, is the same function of the pa- 
rameter (2—0) /т for all wars. This means that all wars can be brought to- 
gether and represented by a single line when 5/8, is plotted against (1—0) /т 
for each war. That this is true, is shown in Figure 9b for all wars analyzed. 

This derivation and demonstration, in effect, shows that if the battle 
strength is expressed in terms of the percentage of ultimate forces to be en- 
gaged and is plotted against the time constant for each particular war, all 
wars studied can be expressed by a single mathematical relationship. Thus, 
each war forms a pattern over time and then history repeats each pattern. 

Figure 9b allows comparison between the Vietnam War and all other wars 
analyzed. Where the Vietnam War stood, with respect to all other wars as 
of June 30, 1968, and every other six-month interval back to June 30, 1965, 
can be seen by the black data points. In sum it can be seen that 86 percent of 
the troops to be committed by both sides had been committed by June 30, 
1968. It can also be seen that the rate of change from now on should stay min- 
imal, providing those conditions that have been acting over the past three 
years continue to act unchanged in the future. 

Deviations of the actual data points from the theoretical curves can be 
traced to a number of causes, such as the effects of elections, weather, and in- 
fluenza epidemics. However, the major phenomenon scen here is the remark- 
able stability and trueness-to-course that, in general, wars follow over long 
periods of time. Instead of complete chaos, which is the expected characteris- 
tic of the conduct of human affairs, the exact opposite condition seems to exist 
when the human affair is warfare. When viewed in the manner demon- 
strated here, wars appear to become remarkably orderly events. 


Н. IMPLICATIONS FOR THE VIETNAM WAR 


d Ww of еюнш of our behavior, the implication of the Vietnam 
е Ы 16 і : . 3 5 
ata is that we, the United States, our allies, and our enemies, as of this writ- 


ing, are approaching a crisis poi : Á . 
g, are approaching a crisis point, where either a settlement is possible or an- 
other major escalation is indicated 


What prompts this statement is the shape of the curves for each of the pre- 
vious wars and the shape of the curves for the Vietnam War. The shape of 
the curves for the past wars, and particularly the manner in which they level 


off, suggests that peace is negotiated when death, casualty, and strength ex- 
ponentially approach the horizontal. и 
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Li 'The overall leveling of these vital statistics can be seen in the British and 
German Armies during World War I, leading to a conclusion in November, 
1918 (as shown in Figure 6) and for the Union and Confederate Armies, 
9 leading to a conclusion in April, 1865 (as shown in Figure 7). The same 
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leveling occurred in the Korean War, Figure 10, and led to à conclusion in 
| July, 1953. An identical pattern can be traced in the first phase of the Viet- 
nam War, Figure 11, which ended December, 1964. It can be further noted 
that the Korean War and the Vietnam War are relatively close together in 


time and place; thus their comparability is best of all the other wars. 
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If these variables for the second phase of the Vietnam War continue to 
level in the latter half of 1968 and during the first half of 1969, then the 
past pattern will have repeated itself for the sixth time in our history. 

As of this writing, since most of the principal nations of the world are sup- 
plying money, material, men, or some combination of the three, indeed, in re- 
ality, a global decision period is imminent. 


TRENDS IN THE VIETNAM WAR 
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" FIGURE 11 
TRENDS IN THE VIETNAM War 
I. SUMMARY 
During the last 100 years, the United States h 
to conclusion : the Civil War, World War I, World War II, and the Korean 
War. For the past eight years, we have been engaged in a fifth and not yet 
concluded war, the war in Vietnam, which has occupied our minds, our 


troops, and our resources longer than any other war in our history. And yet, 
the Vietnam War appears to be Progressing.in the s 3 


as fought four major wars 


ame orderly manner as 
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our four previous wars, despite its record duration. The repetitive behavioral 
pattern suggests that we are approaching a crisis point in the Vietnam War 
where either a settlement is possible or another major escalation is indicated. 
"This analysis is based on a purely descriptive examination of the data—the 
vital statistics—of modern nations at war, particularly the United States, our 
allies, and our enemies during the last 100 years. No attempt is made to ex- 
plain the processes which result in the behavioral orderliness. The data of all 
five wars are seen to fit very closely the same, simple, mathematical laws.? 
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3 For World War II (3), all theaters and campaigns, including the Army Air 
Corps officers and enlisted men, are grouped in U.S. Army casualties, except from 7 
December 1941 to 10 May 1942. During this period three noncombat divisions were 
captured and held for the duration of the war. This resulted in 30,838 casualties and 
13,847 deaths being listed as having occurred in the single month of May, 1942. This 
accounting procedure thus caused the only significant statistical anomaly noted in all 
the data. Hence these data were excluded. For the U.S. participation in World War 
I (1, pp. 120-122), battle deaths include only deaths from action. They do not include 

¿deaths from diseases and privation. Battle deaths for the month of May, 1917, were 
‘stimated as 400, the month of April as 100, and the month of December as 800. With 
these estimates, a 4.4 percent difference exists between total battle deaths of 48,909 
reported on page 122 and the accumulated total deaths on page 120. Total battle cas- 
ualties at the end of the war were 286,330, of which 7347 were classified as missing 
in action or taken prisoner. 230,074 were the total wounded. The rest were killed. For 
certain 0.5. Civil War battles (9, рр. 140-1), Confederate strength on the battlefield 
for a particular battle (not battle strength) had to be estimated and was assumed to 
be one-half of the known Union strength, This, in general, proved to be the case 
throughout the war. Estimates of Confederate battle casualties, when their derivation 
was necessary, were made by multiplying the Confederate battlefield strength by .123, 
which is the average Confederate casualty per soldier engaged in 30 battles of which 
12 were defeats and 18 were victories. Estimates of Confederate battle deaths, when 
made necessary, were derived by multiplying Confederate battle casualties for that 
battle by 17.6 percent, which is the average for 29 battles. Estimates of Union battle 
deaths were made by multiplying Union battle casualties by 17.5 percent, which is the 
average for 37 battles. Union and Confederate missing and prisoners were not avail- 
able. 


